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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

DECLARATION OF TOD BEDILION, Ph.D. 
UNDER 37 C.F.R. § 1.132 

I, TOD BEDILION, Ph.D., declare and state as 

follows : 

1. In April, 1996, I became the first employee of 
Synteni, Inc., where I served as Research Director until its 
acquisition by Incyte Ccrp'ofation in early 1998. After 
Synteni' s acquisition, I continued in the position of Director 
of Corporate Development at Incyte until May 11, 2001. I am 
currently the Director of 'Business Development at Genomic 
Health, Inc., Redwood City, California and an occasional 
Consultant to Incyte. 

2. Synteni was founded to commercialize expression 
microarrays, microarrays in which expressed nucleic acids — 
full-length cDNAs, fragments of full-length cDNAs, expressed 
sequence tags (ESTs) — are arrayed on a common support to 
permit highly parallel detection and measurement of the 
expression of their cognate genes in a biological sample. 

3. During my employ at Synteni, virtually all (if 
not all) of my work efforts were directed to the further 
technical development and the commercial exploitation of that 
microarray technology; given the small size of our shop, most 
of us had both technical and commercial responsibilities. The 
customer accounts for which I was personally responsible 
included large pharmaceutical companies, such as SmithKline 



Beech,*, large biotechnology companies, such as Genentech, a 
small research institutes, such as DNAX Inc. 

4- From my very first interaction with our 
customers, consistently through to Synteni's acguisition by 
incyte, I heard uniform, consistent, and emphatic request/ 

that more genes be added to the arravs tm . 

e arra ys- This was true with 

respect to both our oricinai 

r original microarrays, based on customer- 
provided genes and lihr fl H OC _ j 

- na liberies, and our later, "generic", oene 

expression microarrays, based upon the unigene clone ' 
collection (our so-called -OaiG«- arrays, . From day , the 

pressure on us was to nr^t- ~ 

was to print ever more spots on the array It 

was never a Question- _ J ^ 

ik ^ e « 10 n. our customers wanted ever more genes on 

the array, each new gene-specific probe providing 
incrementally more value to the customer.- 1 

our custom! ' ^ ' Providing value to 

our customers was our major concern. Thus, to increase the 

value of our product <? sr,w o« 

procucts and services in the marketplace — to 

increase our ability to sen r,„r- 

y to sen our microarrays and microarray 
services, their "salabilitv" ~. ^ 

v . . saiaomty — our efforts from the very 

beginning were devoted to increasing th. „ k 

increasing the number of specific 

genes whose expression could be detected u,> h 

oe defected with our microarrays. 

S. Indeed, one of our major competitive advantages 

xn the marketplace nnr y 

P-ace not 3us t as regards other commercial 

suppliers, but also with resoecr m ^h* 

respect to the innumerable 
laboratories and cnmnani A * 

ana companies that were attempting to spo t arrays 

in their own "home-brew" facility u 

lacinties — was the number of 

I should note the emmi.«.. 

specific to only these genes for which tZt ^J"* for addition of P«*« 

encoded gene product was known ^logical function of the 

and all expressed genes ' * Were askin ? for Pr° b " specific to any 



distinct gene-specific probes that we provided on our 
expression microarrays . Our first 10,000 element UniGem array 
put the holy grail of gene expression analysis - the human 

whole genome array - within sight for the very first time 
(with respect to timing of the UniGEM program we began project 

planning and technology development in mid 1996 and delivered 

our first 10,000 element standard content human arrays in the 

first months of 1997 as I recall) . 

7. By the end of 1997, our efforts to provide the 
most comprehensive, and thus most valuable, human gene 
expression microarrays had been sufficiently successful that 
Incyte agreed to acquire Synteni for a reported $80 million. 

8. I declare further that all statements made 
herein of my own knowledge are true and that all statements 
made on information and belief are believed to be true, and 
further that these statements were made with the knowledge 
that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under 
Section 1001 of Title 18 of the United States Code and may 
jeopardize the validity of any patent application in which 
this declaration is filed or any patent that issues thereon. 




Tod Bedilion, Ph.D. Date 
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IN THE UNITED STATES PATENT AND TRADEMARK 



OFFICE 



DECLARATION OF VISHWANATH R. IYER, Ph D 
UNDER 37 C.F.R. § 1. 132 



follows : 



I, VISHWANATH R. IYER, Ph.D. , declare and state as 



1. I am an Assistant Professor in the Section of 
Molecular Genetics and Microbiology, institute of Cellular and 
Molecular Biology, University of Texas at Austin, where my 
laboratory currently studies global transcriptional control in 
yeast, gene expression programs during human cell 
proliferation, and genome-wide transcription factor targets in 
yeast and human. Immediately prio r to this position. I spent 
four years as a postdoctoral fellow in the laboratory of 
Patrick O. Brown at Stanford University studying the 
transcriptional programs of yeast and of human cells My 
curriculum vitae is attached hereto as Exhibit A. 

2. Beginning in Dr. Brown's laboratory, where I 
helped to develop the first whole genome arrays for yeast and 
early versions of highly representative cDNA arrays for human 
cells, and continuing to the present day, I have used 
microarray-based gene expression analysis as a principal 
approach in much of my research. 



3. Representative publications describing this 
work include: 



DeRisi J. et a J -pv«i^ 
genetic control of gene SrSf * 
scale. - Science 21g 9 : %l tSt^.r 1 3 9enom i= 

using » ,»icroa r ra y s, • -S^^'lf^' 

the response of 'iJ^f^SS 1 ^ .»"•>«■ ^ 
Science 283:83-87 (1999),' and ' " 

^"sTon^terns^fhlT^ variati °° in gene 
mature Se^icslS? 2^-2™" 1^" ^ 

Two of the papers describe our use of microarra.-based 
expression prof i lino a 

proriimg to explore the metabolic r*n™„ 

that occurs during .a.or environs! chan esTt h " 
-Kisi et a,, auring the shift fro, fer.entl 0 ^ 

~' "* ^ h ^ <*~ et a,., hlman 

fibroblasts exposed to serum) . One reference describes 
use of expression profile anal ysi s in drug targ " ^ a " 
- identification of secondar. drug effects , t 2 T, 
An one describes our use of expression P ro £i l ing as . ^ 
molecular phenotypino tool ^- 

cells (RoS s « aJ " —I hu^an cancer 



4. 



Whether used to eluci r^t-<=> • 

elucidate basic physiolocn r a i 

responses, to stud y pri Mry and seco„dar y drug ef ec 0 

discriminate and classify ' or to 

classify human cancers ovt-^^,.- 

cheers, expression profiling 
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Attached hereto as Exhibit a. 
Attached hereto as Exhibit C. 
Attached hereto as Exhibit D. 
Attached hereto as Exhibit e. 



as we have practiced it relies for • . 

«l cues tor its nowpr ~ 
patterns of expression. comparison of 

5 - For example, we have demonstrated . 
use the presence or absence of a ^ ° nStrated that we can 
a , . nCe of a characteristic druo 

Signature- pattern of altered o*n- 

secondary effects that can simil tC ldent «y 

side effects. As another J " T""* *- 

example, we have demon«s-r a f a j ^ 
gene expression patterns e -n k ae *>onscrated that 

^ erns can be used to clas^if,, u 
cell lines, while it is of „ classify human tumor 

j.t is of course advantao*»oi, e , 
biological function of the encode anta9eous <"> know the 

reach a b etter ^-nJLTZl^T™' " ^ " 

underlying these results, these It K meChaniSmS 
retire Pledge of the ^ZZTZlT^ * ^ 

proteins. function of the encoded 

6- The resolution of the patterns „..,, • 
orisons is detained oy the nuJr 1 n ^ ""f 
greater the nu^er of genes detected the „ , ^ 
resdution of the pattern It ^ ^ 

resolution patterns are generauT ^ UBh ~ 

comparisons than lover resT n ' * 

x evolution pattemc Ta7 • . , 

resolutions comes a corresoonH • , Wlt ^ such higher 

correspondingly hiaher decree of 
statistical confidence for . ■ °egree of 

aence for distinguishing different r^. 
as well as identifying similar ones. Patterns, 



7. 



Each gene included 



* «i^iuuea as a orobe rm =, - • 

provides a signal that i s specific tQ **** °» * —-ay 
at least to a f irst mr> PBClflc to the cognate transcript. 

a nrst approximation 5 p flrh „ 

n ' Each new gene -specific 

In a more nuanced view ir 
signal the presence of a variety of ^cTvar? -nf^ • Prob. to 

P " ' 8riants °f a single gene. 

(Continued...) 



probe added to a micrnarrau *-u„~ ■ 

a microarray thus increases the number of genes 

detectable by the device ■ ^ 

' lncrMsln 9 the resolving power of 

the device. As I note above. higher resolution patterns are 
generally ..ore useful in comparisons than lower resolution 
patterns. Accordingly, each new gene probe added to a 
microarray increases the usefulness of the device in gen. 
egression profiling analyses. This proposition is so well- 
established as to be virtually an axiom in the art. and JL 
been as long as I have been working in the field, and 
certainiy since the time X emharked on the production of whole 

gene-specific probes are inferior to arrays with more gene- 
specific probes. 

8. For example, our ability to subdivide cancers 
into discriminate classes >«, - cancers 
. ^ classes by expression profiling is limited 

by the resolution of the natt--™. J ted 

Patterns produced, with more genes 
contributing to th«» Bv «r.„.' ywes 
draw V expressi0n Patterns, we can potentially 

draw finer distinctions among the patterns, thus subdividing 
otherwise indistinguishable cancers into a greater nujr OI 
classes.- the greater the numher of classes, the greater the 
likelihood that the cancers classified together will respond 

IZIZ V° thSraPeUtiC tat «™-«- """"ting better 
individualization of theranv 

outcomes. W ^ better 

9. If a gene does not change expression in an 
experiment, or if a oene ic « 

a gene is not expressed and produces no 

(-Continued) 

without discriminating among them an n « 

of a variety of allelic vartan^fof a stZll ZT* "° Sign&1 the P*— ««• 
discriminating among them. single gene, again without 



signal in an experiment- fv, a *. • 

i-„v , , 18 n0t to say that the probe 

lacks usefulness on the 8 rr a » , *^ 

s on the array; xt only means that ^ 

insufficient number of <;«.,• 

,, of conditions have been sanpled to 

identify expression changes, m fact „„ 

that a nP nc 4 ' 311 ex P er *nient showing 

that a gene is not expressed or that its expression level does 
not change can be equally informative To r>roviH • 
versatility as a research tool the 

. . ^ t001 ' the microarray should 

ZTT " " d " 8 bl0l09i " 1 would want " "*«™y t= 

include - each „e„ ly identify sene as , profce 

h»i /°" * deCla " fUrthSr a " «"«"enes nade 

-* « .nfcr^tion rt belief m ™ 
further that these statempnhc 

th»f -,w , . statements were made with the knowledge 
that willful false statements and the like „ 

cne llj te so made are 

punishable by fine or- i'«mv^ 

y xine or imprisonment, or both, under 

Section 1001 of Title ip nf „ . 

Title 18 of the United States Code and mav 
jeopardize the valid! tv = y 
, , va ^ty of any patent application in which 

this declaration is fii-,* cn 

on is fn ed or any patent that issues thereon. 




October 20, 2003 
VISHWANATH R. IYER, Ph D 

Date 
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Vishwanath R. Iyer 

Assistant Professor 

Section of Molecular Genetics and Microbiology 

institute of Cellular and Molecular Biology 

MBB3.212A, University of Texas at Austin 

Austin, TX 78712-0159 

Phone: 512-232-7833 

Fax: 512-232-3432 

Email: vishy@mail.utexas.edu 

Education/Training 

Research Experience 

9/00-5/03 Assistant professor, Section of Molecular Genetics and 

Microbiology, University of Texas, Austin TX 

■ Global transcriptional control in yeast 

- Gene expression programs during human cell proliferation 

SSr 6 ^^ * anscri P tion f actor targets in yeast and human 

■ Collaborative microarray facility D 

5/96-8/00 Post- doctoral fellow Stanford University, Stanford CA 

(Advisor: Dr. Patrick O. Brown) 

" jfSSf n • mC 0RF and inter S e ™ microarrays 

Human cDNA microarrays for expression profiling 

9/89-4/96 o-dg. ^Harvard University, Ca mbri d g e MA 

■ Yeast transcriptional regulation 

Honours and Awards 

Government of India Biotechnology Fellowship (1987-1980) 
^^^F^Co^sAm Junior Research Fellowship (1989) 
Stanford Umversity/NHGRI Genome Training Grant (1996) 

Invited Conference talks (selected) 

Invited Lecturer, NEC-Princeton Lectures in Biophysics 
Princeton, NJ (June 1998) 

H 1SK£SS^ (HU00 Human Genome 
™^K^ n S arch Conference " Human Mo,ecular Genrtte " 



<*JS^nT- P j n | H f bor ^ratoiy Meeting (March 2003) 

Invited Speaker in FunctionaJ toomte (&„ e &w?|^ pnl f «*> . 

Invited Speaker "BioArrays Europe 2003" 
Cambridge, UK (Sep/Oct 2003) 

Departmental Seminars 

^ctoW If 2002 ^ BiOChemis ^ & "brtF- Departments, 

OT MayS m MCdiCal CCmer ' HUmaD Genetics Seminar Series, 

UCLA School of Medicine, Department of Human Genetics 
June 2 2003 

National Human Genome Research Institute 
June 12 2003 

Sanger Institute of the Wellcome Trust, Hinxton UK 
Sep 2003 ' 

Other Professional Activities 
Rev^er for Genome BiCogy, Genome Research. Nature Genetics, Science ( 199 8- 

^oT'^™ 8 Harb ° r $Ummet °"™ " Maki "S -0 DNA Microarrays" 
Member, NIDDK Special Emphasis Review Panel ZDKi (2001-2002) 

Publications 

1. JyexY & StruM, K. (i 995 ) Poly(dA:dT), a ubiquitous promoter element that 
annulate* transcnphon via its intrinsic DNA structure, £wlo ^"2570.2579. 

3 S^^^^ - * 



4 ' ^Slf" V lm}L£ ' & . Brown R °- C'997) Exploring the metabolic and Kinetic 
control of gene express™ on a genomic scale. Science 278:680-686 

arug target enects using DNA microarrays. Nature Med. 4:1293-1301 

6. Lutfi yya L. L. , tet^A, DeRisi J., DeVit M. J., Brown P. 0. & Johnston M fioofil 
Charactenzafcon of three related glucose repressors and genes fa 9 } 
Saccharomyces cerevisiae. Genetics 150:1377-1391 ^ regulate m 

7- Spellman P. T., Sherlock G., Zhang M. Q., JverV R Anders V f™ mod 
0., Botstein D. & Futcher B. (1998) CoijSsfc MSS&Srfi c^/ 0 ™ 

£3££ (3999) ^'^^^^ D - & 

fibroblasts to serum. Science 283:83-87 *»i*«»e 01 numan 

*" L ' & (1 " 9) Gen ° miCS Md am * Ctor. Qpm. Onco/. 

Nature Genetics 24: 227-235 expression P attera * human cancer cell lines. 
11. Sudarsanam P JverV R Brown P. 0. & Winston F. (2000) Whole-eenome 

9^3™^" ° {snf/swi mutants of & ~' ae - ^ ^SSS .(0S4) 

" SS£ - Chd?X D buddS'^ The ch ™° domain 

£MBO J 19: 2323-2331 ATP-dependent chromatin-modirying factor 



15- IverV. R.. Horak C, Scafe C. S Botstein n c n ,,/i a , w c » 

Genomic binding si es of the yeas? SSwfc S ^ & ^ P ' °' (2001 ) 
Mrture 409: 533-538 ^ transcnption factors SBF and MBF 



omaru Y., Carnind 
Tokusumi Y.. Kiku 



M., DeRisi J.L., Eisen M.B""lver V R nZTVn °™° mi Y ° Shiki A " Kusal «be 
Okazaki Y. & HayasWzaki Y SSn^' Munmrtni M. f Shimada H., 
pathvjays in viJby ^ 
ennched mouse cDNA arrays Proc. Natl. Acad. Sci^) %°L*t£™ ^ 

17 ' S1 J 3 ^pK 5 S° 02) CharaCteri2inS 

Manua/ feds. Bowtell, D & SaXook ? Wo Z ?nZ°f : * Mole ^r Cloning 
Press, 2003). fcamorook, J.) 453-463 (Cold Spring Harbor Laboratory 

*(not peer reviewed) 

19. Killion, P., Sherlock G. and Iver V R r«?nno"» tk« t ~ v 

20. Hahn J. S., Hu Z., Thiele D. J & Tver V R r onnma w j * 1 • 
Stress Responses Throueh Heaf sWVT,' Gen . ome "^ lde Analysis of the Bi 

v » nrougn Heat Shock Transcnption Factor (submitted to / 



Biology of 
PNAS) 



21. Kim J. & IverV.R. The global role of trp ro ™,i«.~ * . 

gene exp^efeofilef^S P«p" Pr ° m ° tere ****** 



Current/Pending Research Support 

U01 AA13518-01 Adron Harris (PI) 25% effort 
9/28/01 - 9/27/06 

NIH/NIAAA 

"INIA: Microarray Core" 

355&^ N ~ 6nCe *? ^ on Alcoholism 

to define changes in gene S 2£ PP< ? the USC ° f microa ^y technology 
consumption. 8 ex Pr«*">n that either predict or accompany excessive alcohol 

Role: Co-investigator 



003658-0223-2001 Iyer (PI) 16% effort 
01/01/02 - 08/31/04 

Texas Higher Education Coordinating Board (ARP) 

M-croarray based global mapping of DNA-protein interactions at promoter, in human 

This* a pilot project to map the in vivo interactions of transcription factors with human 
Role: PI 



WW^olwo?® 0325116 R ' M ™*y TO 9% effort 

NSF 

"Feedback from Multi-Source Data Mining +« r- • . 

Discovery" nmg t0 Experimentation for Gene Network 

Role: Co-investigator 

iVVo^-f^J/o'f (Pending) ^ (W) 25?< -fcrt 

NIH 

"Analysis of genome-wide transcriptional control in veast" 

£'S^5r to- in yeast through 

Role: PI 

335 ?12/31M AW3rd (PendinS) IyCr (PI) 10% effort 
US Army Medical Research and Materiel Command 

S^ecttS^^ 

human cells through the use JEXfcSSSS^ - » * 

003658-0531-2003 (pending) Marcotte (PI) 8% effort 
01/01/04-12/31/05 ^o/oenort 

Texas Higher Education Coordinating Board (ATP) 

££S5*~* P^tfonn for measuring gene mnction on a 

h^=gtp^ Plf orm for automated, 
of gene function. & g 1S> allowin g «Pid and systematic evaluation 
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1 fiK*w-Vbe,Sdanc» 270. 1828 (189$. * -< ' 
* T 3S T.C James and & c. Btfn. Mat Oaf dtt. 6. 3862 

fJL^** * 263 l** 1 * B - Tscrfcrsch ef at. 
EMBOJL 13.3822(1994): MLT fctefreotfefaLCaf 
«T. 75 41996* 0. a States, K.D. T«tof, R P. feny 
^DCNrttvtaiet Set USAiftt. 7137 (1996J. 
;,-36? P- 3 Patanarf er At. J. Sot Own 266. 10750 
^99l);A.SchrrtfcK.KGartefTan.j.fi 8Cter p 



S^^S*"*'* 1 S»*en, Moot** .68 
<«B 0992);..V. Stoma. K S^J^kS^ 

< ^Sacfanot 178. 3015 0996).. 
:37.,M.Ho«^.Caff.77. 86911994).' ' 

38. VVe thank k Skatetsky and F/Lewttter tor he* wtth 



'-j^^toc^j,, 1997 



,Exhib/t ; rB ; attached to/ptatirata<of Vbhwanath 
K lyer, Ph.D" " - - - v * ,o.\ 
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i, „ 



Exploring the Metabolic: and Genetic Control of 
Gene Expression on a Genomic Scale . 

■ i( Joseph^ -DeRisi, N^shwanath R.' Iyer, Patrick Brown 4 <u 



teearru'nii^''^^ cwitariing) virtually every gerie of Saccharomyces cerdrisiaewere Used 
to carry out a comprehensive invest cation of the temoorai J~ !7 
accompanying the metabolic shift from fJ2£S7S^SS^ 
profiles observed for genes with known metabolic funrti«nc S I J* ex P ress,on 
metabolicreprcsrammW^ fe * UreS °' the 

of many previously unchiacterized genes provS^ 

same DNA microarrays were also used to id^tr^ qenes^o^P o»n ■ ctlons - 
by deletion of the transcriptional co-^SmpT^^^Z^' 0 ?^ affected 
tional activator YAP1. These results SJS^^iSfSSS^^ fi? G * h 
proach to genomewide exploration of gene * * 8P ' 



The complete sequences of nearly a dozen 
microbial genomes are known, and in the 
next several years we expect to know the 
complete genome sequences of several 
metazoans, including the human genome. 
Defining the role of each gene in these 
genomes will be a* formidable task, and un- 
derstanding how the genome functions as a 
whole in the complex natural history of a 
living organism presents an even greater 
challenge. 

Knowing when and where a gene is 
expressed often provides a strong clue as to 
its biological role. Conversely, the pattern 
of genes expressed in a cell can provide 
detailed information about its state. Al- 
though regulation of protein abundance in 
a cell is by no means accomplished solely 
by regulation of mRNA, virtually all dif- 
ferences in cell type or state are correlated 
with changes in the mRNA levels of many 
genes. This is fortuitous because the only 
specific reagent required to measure the 
abundance of the mRNA for a specific 
gene is a cDNA sequence. DNA microar- 
rays, consisting of thousands of individual 
gene sequences printed in a high-density 
array on a glass microscope slide (J, 2), 
provide a practical and economical tool 
for studying gene expression on a very 
large scale (3-6). 

Saccharomyces cerevisiae is an especially 



Department of Biochemistry, Stanford University School 
of Medicine. Howard Hughes Medcal Institute. Stanford 
CA 94305-5428. USA. 1 

'To whom corresporiUtfKtt should be addressed. E-mafc 
porowne j ci i >gi i LStanlord.edu 
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favorable organism in which to conduct a 
systematic investigation of gene expression. 
I ne genes are easy to recognize in the ge- 
nome sequence, ris regulatory elements are 
generally compact and close to the tran- 
scription units, much is already known 
about its genetic regulatory mechanisms, 
and a powerful set of tools is available for its 
analysis. 

A recurring cycle in the natural history 
of yeast involves a shift from anaerobic 

£? enUUOn) t0 aerobic < rcs P ir «ion) me- 
tabolism. Inoculation of yeast into a medi- 
um nch m sugar is followed by rapid growth 
fueled by fermentation, with the production 
of ethanol. When the fermentable sugar is 
exhausted, the yeast cells tum to ethanol as 
a carbon source for aerobic growch. This 
switch from anaerobic growth to aerobic 
respiration upon depletion of glucose, re- 
ferred to as the diauxic shift, is correlated 
with widespread changes in the expression 
of genes involved in fundamental cellular 
processes such as carbon metabolism, pro- 
•ynthesis, and carbohydrate storage 
(/). We used DNA microarrays to charac- 
teme the changes in gene expression that 
take place during this process for nearly the 
entire genome, and to investigate the ge- 
netic circuitry that regulates and executes 
this program. 

Yeast open reading frames (ORFs) were 

?Sno( icd by * e P° l V mmsc cha «n reaction 
(PCR), with a commercially available set of 
primer pairs (8). DNA microarrays, con- 
taming approximately 6400 distinct DNA 
sequences, were printed onto glass slides by 



SCIENCE • VOL 278 • 24 OCTORpr ioo-j . 

*, 0 ^ v-a-. 1 uotR ]997 . www.sciencemag.org 



gluing a simple tobotic printing device (9). 
- Cells from an exponentially growing cujture 
of-yeast were inoculated into fresh medium 
^!**™" s 3WC*r 21 hours/Afef.an 
1 initial 9 noun of growth, samples were har- 
v « tc a^«ven successive 2.hour intervals, 

1 ^ fc** 1 ( Ruorescently 

labeled cDNA was prepared by reverse tnuv 

o^Cy3(green)- 
or Cy5(red)4abeled deoxyuridine triphos- 
phate (dUTP) <]]): and then^hybridized to 
; the microarrays (J 2). To maximae the re- 
tability with which changes .tn exi»ession 
levels could be discerned, we labeled cDNA 
prepared fromtoells at each successive time 
point with Cy5, then mixed it with a Cy3- 
labeled-reference" cDNA sample prepared 
from cells harvested at the first interval 
after inoculation.-In this experimental de. 
sign, the relative fluorescence intensity 
measured for the Cy3 and Cy5 fluors at 
each array element provides a reliable mea- 
sure of the relative abundance of the corre- 
sponding mRNA in the two cell popula- 
. tions (Fig. ^ 1). Data from 1 the series of seven 

f^wf (Fig ' 2)> consistin g more than 
^->,000 expression-ratio measurements, 
were organized into a database to facilitate 
efficient exploration and .analysis of the 
results. This database is publicly available 
on the Internet (13). 

During exponential growth in glucose- 
rich medium, the global pattern of gene 
expression was remarkably stable. Indeed, 
when gene expression patterns between the 
first two cell samples (harvested at a 2-hour 
interval) were compared, mRNA levels dif- 
fered by a factor of 2 or more for only 19 
genes (0.3%), and the largest of these dif- 
ferences was only 2.7-fold (HI However, as 
glucose was progressively depleted from the 
growth media during the course of the ex- 
periment, a marked change was seen in the 
global pattern of gene expression. mRNA 
levels for approximately 710 genes were 

' dkia i , 3 , faCl0r of at lcasl 2 > *e 
mRNA levels for approximately 1030 genes 

™ b T a , factor of al lca * 2 - Messenger 
RNA levels for 183 genes increased by a 
factor of at least 4, and mRNA levels for 
203 genes diminished by a factor of at least 
4. About half of these differentially ex- 
pressed genes have no currently recognized 
function and are not yet named. Indeed 
more than 400 of the differentially ex- 
pressed genes have no apparent homology 



to any gene whose function is known (25). 
TTie Te^h4e$7df .these *j^vio«sIy . uhehar- 
acteroed genes to file diauiic siiifY^ 
provides the first small clue to their possible 



• :■ T*}t global . view of changes in expres- 
sion of genes with known functions pro- 
vides a viVid picture of the in- which 
the "cell adapts to. a changing - environ- 
ment- Figure 3 shows a portion of the yeast 
metabolic "pathways' involved in carbon 
and ■ energy. ; metabolism. --Mapping^ the- 
changes. we observed .in.the mRNAs en- 
coding, each enzyme onto this framework* 
. , allowed us to infer the redirection in the ; 
flow of metabolites ■through this system. 
We observed large inductions of the genes 
coding for the enzymes aldehyde dehydro- 
genase 4ALD2) and : acetyl-cberuyme 
A(GoA) synthase^ (ACS! ); which ■ fiinc- 
tion together to convert the products of 
alcohol dehydrogenase into aceryl-CoA, 
which in turn is used to fuel the tricarbox- 
ylic acid (TCA) cycle and the glyoxylate 
cycle." The concomitant shutdown of tran- 
scription of the genes encoding pyruvate 
decarboxylase and induction of pyruvate 
carboxylase rechannels pyruvate away 
from acetaldehyde, and instead to oxalac- 
etate, where it can serve to supply the 
TCA cycle and gluconeogenesis. Induc- 
tion of the pivotal genes. PCKl t encoding 
phosphoenolpyruvate carboxykinase, and 
FBP/, encoding fructose l;6-biphos- 
phatase, switches the directions of two key 
irreversible steps in glycolysis, reversing 
the flow of metabolites along the revers- 
ible steps of the glycolytic pathway toward ' 
the essential bibsynthetic precursor, glu- 
cose-6-phosphate. Induction of the genes 
coding for the trehalose synthase and gly- 
cogen synthase complexes promotes .chan- 
neling of glucose-6-phosphate into these 
carbohydrate storage pathways. 

Just as the changes in expression of 
genes encoding pivotal enzymes can pro- 
vide insight into metabolic reprogram- 
ming, the behavior of large groups of func- 
tionally related genes can provide a broad 
view of the systematic way in which the 
yeast cell adapts to a changing environ- 
ment (Fig. 4). Several classes of genes, 
such asHeytochrome c-related genes and 
those involved in the TCA/glyoxylate cy- 
cle and carbohydrate storage, were coordi- 
nate^ induced by glucose exhaustion. In 
contrast, genes devoted to protein synthe- 
sis, including ribosomal proteins, tRNA 
synthetases, and translation, elongation, 
and initiation factors, exhibited a coordi- 
nated decrease in expression. More than 
95% of ribosomal genes showed at least 
twofold decreases in expression during the 
diauxic shift (Fig. 4) 113). A noteworthy 
and illuminating exception was that the 



genes encoding mitochondrial, ribosomal 
geries-were^geneTally induced ratKerthan 
; repressed after glucose limitation, high- 
lighting the requirement for mitchondria! 
; biogehesisfI3).; As more is learned about 
*e;functions;. 0 f eyerjr gene 0 in the'yeast 
genome, the ability, to gain insight into a 
^"V r «Pomc to a changing environment 
through its global gene expression patterns 
will become increasirigly f pbwefful. 
. Several distinct temporal patterns of ex- 
pression could > recognueoV and sets of 
genes could ^grouped on the basis of the 
similarities in their expression patterns. The' 
characterized inemrjers; of each of these 
^u^,also>hared:ini^rmt similarities in 
their :funaions: • Moreover, m. most cases, 
commcih regulatory mechanisms could be 
inferred for sets, of genes with similar expres- 
sion,- profiles; , For . example, seven genes 
showed a late induction profile, with mRNA 
levek increasing by more than ninefold at 



Reports 



the last tirnepoint iurkss than thieefoid at 
die preceding rin^irtt^{Fig.: 5B); VMi of 
these gerio ^ 

pressed, and five of the seven were previously 

M«'tDlih»vc»rim activat 
ing sequence; (UAS),^ the carbon source re- 
sponse elerherit (CSRE) r ( JcV?^ A^scarch 
in the promoter regions of the rernaining two 
genes, : ACRl and /DP2, revealed that 
ACRI, a gene essential for ACSi activity, 
also possessed a consensus CSRE motif but 
interestingly, JDP2 did not A search of the 
entire; yeast genome sequence for the con- 
sensus .CSRE morif revealed .only Tour addi- 
tional candidate genes, none of which 
showed a similar induction. 
. Examples from additional froiips of 
* cn « J hal v ^pressioh profiles are 

illustrated in Fig. ,5,; G through, F. 'The 
sequences upstream of the named genes in 
fig. 5C all contain stress response ele- 
ments (STRE), and witn the exception 




Fig. 1. Yeast genome microarray. The actual size of the microarrav « ir mm k„ -o rw 
microarray was printed as described (9). ThisTmaoe was o^S Ll t£ * 1 ! mm * 7)18 
sinning confocal microscope used to Volte* a MhfdaTa y^^^T^T fluorBSCent 

from the same culture 9.5 hours later (culture densitv of -2 x ine r^ricZi Tk ? . taten 
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of HSP42, have previously been shown to 
be controlled at least in pan by these 
elements (2J-24). Inspection of the se- 
quences upstream of HSP42 and the two 
uncharacterized genes shown in Fig. 5C, 
YKL026c, a hypothetical protein with 
similarity to glutathione peroxidase, and 
YGR043c. a putative cransaldolase, re- 
vealed that each of these genes also pos- 
sess repeated upstream copies of the stress- 
responsive CCCCT motif. Of the 13 ad- 
ditional genes in the yeast genome that 
shared this expression profile [including 
HSP30, ALD2 t OM45, and 10 uncharac- 
terized ORFs (25)), nine contained one or 
more recognizable STRE sites in their up- 
stream regions. 

The heterotrimeric transcriptional acti- 
vator complex HAP2,3 f 4 has been shown 
to be responsible for induction of several 
genes important for respiration (26-28). 
This complex binds a degenerate consensus 
sequence known as the CCAAT box (26). 
Computer analysis, using the consensus se- 
quence TNRYTGGB (29), has suggested 
that a large number of genes involved in 
respiration may be specific targets of 
HAP2 t 3,4 (30). Indeed, a putative 
HAP2,3 t 4 binding site could be found in 
the sequences upstream of each of the seven 
cytochrome c-related genes that showed 
the greatest magnitude of induction (Fig. 
5D). Of 12 additional cytochrome c-related 
genes that were induced, HAP2 t 3,4 binding 
sites were present in all but one. Signifi- 
cantly, we found that transcription of 
HAP4 itself was induced nearly ninefold 
concomitant with the diauxic shift. 

Control of ribosomal protein biogenesis 
is mainly exerted at the transcriptional 
level, through the presence of a common 
upstream-activating element (UAS^) 
that is recognized by the Rapl DNA-bin3- 
ing protein (3 J, 32). The expression pro- 
files of seven ribosomal proteins are shown 
in Fig. 5F. A search of the sequences 
upstream of all seven genes revealed con- 
sensus Rapl -binding motifs (33). It has 
been suggested that declining Rapl levels 
in the cell during starvation may be re- 
sponsible for the decline in ribosomal pro- 
tein gene expression (34). Indeed, we ob- 
served that the abundance of RAP] 
mRNA diminished by 4.4-fold, at about 
the time of glucose exhaustion. 

Of the 149 genes that encode known or 
putative transcription factors, only two, 
HAP4 and S1P4 , were induced by a factor of 
more than threefold at the diauxic shift. 
SIP4 encodes a DNA-binding transcrip- 
tional activator that has been shown to 
interact with Snfl, the "master regulator" of 
glucose repression (35). The eightfold in- 
duction of SIP4 upon depletion of glucose 
strongly suggests a role in the induction of 



downstream genes at the diauxic shift. 

Although most of the transcriptional 
responses that we observed were not pre- 
viously known, the responses of many 
genes during the diauxic shift have been 
described. Comparison of the results we 
obtained by DNA microarray hybridiza- 
tion with previously reported results there- 
fore provided a strong test of the sensitiv- 
ity and accuracy of this approach. The 
expression patterns we observed for previ- 
ously characterized genes showed almost 
perfect concordance with previously pub- 
lished results (36). Moreover, the differ- 
ential expression measurements obtained 
by DNA microarTay hybridization were re- 
producible in duplicate experiments. For 
example, the remarkable changes in gene 
expression between cells harvested imme- 
diately after inoculation and immediately 
after the diauxic shift (the first and sixth 
intervals in this time series) were mea- 
sured in duplicate, independent DNA mi- 
croarray hybridizations. The correlation 
coefficient for two complete sets of expres- 
sion ratio measurements was 0.87, and for 
more than 95% of the genes, the expres- 



sion ratios measured in these duplicate 
experiments differed by less than a factor 
of 2. However, in a few cases, there were 
discrepancies between our results and pre- 
vious results, pointing to technical limita- 
tions that will need to be addressed as 
DNA microarray technology advances 
(37 , 38). Despite the noted exceptions, 
the high concordance between the results 
we obtained in these experiments and 
those of previous studies provides confi- 
dence in the reliability and thoroughness 
of the survey. 

The changes in gene expression during 
this diauxic shift are complex and involve 
integration of many kinds of information 
about the nutritional and metabolic state 
of the cell. The large number of genes 
whose expression is altered and the diver- 
sity of temporal expression profiles ob- 
served in this experiment highlight the 
challenge of understanding the underlying 
regulatory mechanisms. One approach to 
defining the contributions of individual 
regulatory genes to a complex program of 
this kind is to use DNA microarrays to 
identify genes whose expression is affected 



Rq. 2. The section of the ar- 
ray indicated by the gray box 
in Rg. 1 is shown for each of 
the experiments described 
here. Representative genes 
are labeled. In each of the ar- 
rays used to analyze gene 
expression during the diauxic 
shift, red spots represent 
genes that were induced rel- 
ative to the initial timepoint, 
and green spots represent 
genes that were repressed 
relative to the initial timepoint. 
In the arrays used to analyze 
the effects of the tuplb mu- 
tation and YAP1 overexpres- 
son, red spots represent 
genes whose expression was 
increased, and green spots 
represent genes whose ex- 
pression was decreased by 
the genetic modification. Mote 
that distinct sets of genes are 
induced and repressed in the 
efferent experiments. The 
complete images of each of 
these arrays can be viewed on 
the Internet (13). Cell density 
as measured by optical densi- 
ty (OD) at 600 nm was used to 
measure the growth of the 
culture. 
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by mutations in each putative regulatory 
gene. As a test of this strategy, we analyzed 
the genomewide changes in gene expression 
that result from deletion of the TUP I gene. 
Transcriptional repression of many genes by 
glucose requires die DNA-binding repressor 



Migl and is mediated by recruiting the tran- 

Ssn6 (39). Tupl has also been implicated in 
repression of oxygen-regulated, rnating-type- 
Jgific, and DNA-damage-inducible genes 
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Rg- 3. Metabofic reprogramming inferred from global anah«R «i ,-h=r. 

metabolic intermediates are identified. The veal omaerS^TE! " 9606 expressioa ^ ke * 
in this metabolic circuit are identified by r«rrWfST^S en2ymes ,hat *«* «ep 

and glycogerKJebrancring enzyme^^ b^Si? K!^ 9 ^* 00 * s ^ ,hase 
YPR184 show sigrt.icarZrnoto^ lo^own S^^J^^^ 0 ^ YGR244 and 
zymes. respectively, end ere therefore included in»***~lZT?? flVcogen-debranching en- 
white lettering iden% m^^S^^Z^^TSf^ ™*«9ure. Red boxes with 
green tetterhg identify^enes wrK^TpSn^Sl^ «»■ Gree " with dark 
induction or repression te indicated foMhe^n^^t, * dlauxc ^ *>» magnitude of 
succinate dehydrogenase, the indicated fokTrSt, ™™Tt!f enZyme «wpi«*. such as 
genes listed in the box. Back and wt^Vr^^Srx! s^rn^ 9 ^ avera9e 01 8,1 
twofold). The erection of the arrows ccW^ ~^T«^^ m f^*"" 31 expression P«s than 
ttowof metabofcintermediates. irrferre^^o^ the direction of the 

representing steps catalyzed by oWwtase m^^^? ^ f ' aner ,he **• Arrows 
The broad Sn^rows^pre^n^ ™ h W<* in red. 

inferred trornfte irx^ted^^^r^P^n ° metab0Wes after «» shift. 
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Wild-type yeast celb and celb bearinc 
adeknon of the TUP, gene (tuplfi?% 
grown in parallel cultures in rich medium 
contamtng glucose as the carbon source. 
Messenger RNA was isolated from expo- 
nentially growing celb from the two dod- 
ulations and I used to prepare cDNA la- 
beled w,th Cy3 (green) and Cy5 (red) 
respectively (Jl). The labeled probe were 
mixed and simultaneously hybridized to 
the microarray. Red spots on the microar- 
my therefore represented genes whose 
transcription was induced in the rub J A 
strain and thus presumably repressed by 
1 upl (4/ ). A representative section of the 
microarray (Fig. 2, bottom middle panel) 
iHustrates that the genes whose expression 
was affected by the tupM mutation, were, 
in general, dtstinct from those induced 
upon glucose exhaustion [complete images 
of all the arrays shown in Fig. 2 are avatf- 

34 (10%) of the genes that were induced 
by a factor of at least 2 after the diauxic 
shift were similarly induced by deletion of 
TUP], suggesting that these genes may be 
sub,ect to TUPl-mediated repression by 
gUjcose. For example. SUC2, the gene en- 
coding invertase. and all five hexose trans- 
porter genes that were induced during the 
course of the diauxic shift were similarly 
induced, in duplicate experiments, by the 
deletion of TUP/. 

The set of genes affected by Tupl in this 
experiment also included a-glucosidases, 
^e mattng-typ^pecific genes MFAJ and 

a a " d mR << a* as genes involved 
m floccuUtion and many genes of unknown 
function. The hybridization signal com- 
sponding to expression of TUP J itself was 
also severely reduced because of the (in- 

I, a 1 "' 0 " ° f Ae tran *riPtion unit 
in the tuplA strain, providing a positive 
control in the experiment (42). 

Many of the transcriptional targets of 
Tupl fell mm sets of genes with related 
b.ochem.cal functions. For instance, al- 
though only about 3% of all yeast genes 
appeared to be TUP1 -repressed by a factor 

uLT,l " 2 J" duplicate «^«nent. 
under these conditions. 6 of the 13 genes 

that have been implicated in flocculation 

(15) showed a reproducible increase in 

expression of at least twofold when TUP J 

was deleted. Another group of related 

genes that appeared to be subject to TUPJ 

repression encodes the serine-rich cell 

wall I ^apoproteins, such as Tipl and 

T.rl^rpl which are induced by cold 

shock and other stresses (43). and similar. 

(44). Messenger RNA levels for 23 of the 

tt ge ."?L m . group were "Ptoducibly 
elevated by at least 2.5-fold in the tuplA 



nd 18 of these geno were induced YAP I ^ „ .... fWtW **»^^ 



«ram. and 18 of these genes were induced 
by more than sevenfold when TUP1 was 
deleted. In contrast, none of 83 genes that 
could be classified as putative regulators of 
the cell division cycle were induced more 
than twofold by deletion of TUPl Thus 
despite the diversity of the regulatory sys-' 
tons that employ Tupl. most of the genes 
that it regulates under these conditions 
tan into a limited number of distinct func- 
tional classes. 

Because the microarray allows us to 
monitor expression of nearly every gene in 
yeast we can. in principle, use this ap- 
proach to identify all the transcriptional 
targets of a regulatory protein like Tupl . It 
is important to note, however, that in any 
single experiment of this kind we can only 
recognue those target genes that are nor- 
mally repressed (or induced) under the 
conditions of the experiment. For in- 
stance, the experiment described here an- 

l A\i\l^\ a 5train in which MFAJ 
and MFA2, the genes encoding the a- 
ractor mating pheromone precursor, are 
normally repressed. In the isogenic tupl* 
strain, these genes were inappropriately 
expressed reflecting the role that Tupl 
plays m their repression. Had we instead 
carried out this experiment with a MATA 
strain (in which expression of MFAJ and 
MFA2 is not repressed), it would not have 
been possible to conclude anything re- 
garding the role of Tupl in the repression 
of these genes. Conversely, we cannot dis- 
tinguish indirect effects of the chronic 
absence of Tupl in the mutant strain from 
effects directly attributable to its partici- 
pation in repressing the transcription of a 
gene. 

Another simple route to modulating the 
activity of a regulatory factor is to overex- 
press the gene that encodes it. YAP J en- 
codes a DNA-binding transcription factor 
belonging to the b-rip class of DNA-bind- 
mg proteins. Over-expression of YAP J in 
yeast confers increased resistance to hydro- 
gen peroxide, o-phenanthroline. heavy 
metals, and osmotic stress (45). We ana- 
■y*d differential gene expression between a 
wiW-type strain bearing a control plasmid 
and a strain with a plasmid expressing YAP] 
under the control of the strong GALl-W 
Promoter, both grown in galactose (that is 
a condition that induces YAP1 overexpres-' 
1 ' C^P'ementary DNA from the con- 
trol and YAP] overexpressing strains, la- 
beled wirJ, Cy3 and Cy5, respectively, was 
Prepared from mRNA isolated from the two 
strains and hybridized to the microarray. 
i bus, red spots on the array represent genes 

Syapi induced ta * e strain owex i«"- 

Of the 1 7 genes whose mRNA levels 
increased by more than threefold when 



JAW f was overexpressed in this way, five 
bear homology to aryl-alcohol oxidoreduc- 

tour of the genes in this set also belong to 
«he general class of dehydrogenases/oxi- 
doreductases. Very little „ faJSTSt 
the role of aryl-alcohol oxidoreductases in 
L^"??' b , Ut ^ en ^"« have been 

ssris? ,i8nino,yiic » 

tiZ ^ C,pate ,n C0 "Pled redox reac- 
tions. ox.dtting aromatic, and aliphatic 
unsaturated alcohols to aldehydes wiSuhe 
production of hydrogen peroxide MM7) 
The fact that a remarkable fraction of ie 
targets identified in this experiment be! 
J»g to the same small, functional grSuptf 
oxidoreductases suggests that these genes 

Rfl. 4. Coordinated reg- 
ulation of functionally re- 
lated genes. The curves 
represent the average in- 
duction or repression ra- 
tios for all the genes in 
each indicated group. 
The total number of 
genes in each group was 
as follows: ribosomal 
proteins, 112; translation 

eton9ation and initiation , 

ttEE5SSStt.-S££i 52-* » 

^e-cycte enzymes, 24. 0356 80(3 reductase Proteins. 1 9; end TCA- and^y^ 



"^V of these genes encode su«r\2 

(TTAr^^JZ Vending «t« 
(TTACTAA or TGACTAA) in die Z 

^mined (48). About two-thirds of the 
El5* by more than 

t£t t U T , YapI w ««P«sion had 
uZl ^ indin e «<« within 600 base. 
UPJ» .of the start codon (Table 1), ^ 

ZTltl£l y ^ direcd V regulated^ 
*apl. The absence of canonical Yapl-bind- 
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OYE3 
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Description 

Putative ary^aicohoJ reductase 
Similarity to bacterial csgA protein 
Transcnptional activator involved in 

oxidative stress response 
Homology to aryt-alcohol 

dehydrogenases 
Putative glutathione transferase 

Putative aryl-aicohol reductase 
Amrotriazote and 4-nitroquinotine 
resistance protein 

Homology to benomyt/methotrexate 

resistance protein 
Hypothetical protein 

r^D^;^ 01 dehydrogenase 
iwdh dehydrogenase (ou yellow 

enzyme), isotorm 3 
H^togy to hypothetical proteins 

YCRl02c and YNLl34c 
H( ^?^ ,0 """apical protein 

NAO(P)H oxidoreductase (old yellow 
enzyme), isotorm 1 7 

S, SX 0A,/w/is ~»ta<rysta», 
Maiate dehydrogenase 



Fold- 
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12.9 
10.4 
9.8 

9.0 

7.4 
7.0 

6.5 
6.5 

6.1 

6.1 
6.0 
5.8 

4.7 

4.5 

4.1 

3.7 

3.3 



tag sites upstream of the others may reflect 
■natality of Yapl to bind sites that differ 
nom the canonical binding cites, perhaps in 
cooperation with other factors, or less like- 
ly, may represent an indirect effect of Yapl 
overexpression, mediated by one or more 
intermediary factors. Yapl sites were found 
only four times in the corresponding region 
of an arbitrary set of 30 genes that were not 
differentially regulated by Yapl. 

Use of a DNA microarray to character- 
ise the transcriptional consequences of 
mutations affecting the activity of regula- 
tory molecules provides a simple and pow- 
erful approach to dissection and character- 
ization of regulatory pathways and net- 



works. This strategy also has an important 
practical application in drug screening. 
Mutuant in specific genes encoding can- 
didate drug targets can serve as surrogates 
for the .deal chemical inhibitor or modu- 
tator of their activity. DNA microarray, 
can be used to define the resulting signa- 
ture pattern of alteration, in gene expVet 
sion, and then subsequently used in an 
assay to screen for compounds that repro- 

iW **" t6 "«»««« Pattern. 

. m 'croarrays provide a simple and 
economical way to explore gene expres- 
sion pattern, on a genomic scale. The 
hurdles to extending thi, approach to any 
other organism are minor. The equipment 
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required for fabricating and wine DNA 
microarray, (9) consist, of c«n?on«m 
that were chosen for their mod«c«^ 
«mplu:,ty. It was feasible for a small group 
jo accompluh the amplification of "morf 
than 6000 gene, in about 4 months and. 
once the amplified gene sequence, were in 
tand. only 2 days were required to print , 
set of 110 microarray, of 6400 elements 
f!jfl Preparation, hybridization, 

and fluorescent imaging are also simple 
procedures. Even conceptually simpk ex- 
periments, as we described here, can yield 
vast amounts of information. The value of 
the information from each experiment of 
this kind will progressively increase as 
more is learned about the functions of 

define 8 *! ?! 1* t^* 0 " 8 ' ex Pe"«nents 
siJnTn? 8,oba ' chan 8« « «ene expres- 
sion in diverse other natural processes and 
genetic perturbations. Perhaps the greatest 
challenge now is to develop efficient 
methods for organising, distributing, inter- 
preting, and extracting insight, from the 

X^ir of dara *- 
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vacuum oven. DNA samples from lO^J PtS™^ 
ton, were ourtSed by etharo. rJ^^S 

suspenoedn 3x standard same curate rssn mZt 

>w«(l cramber and then snacwjrieo k> ^a~,.h^ 
W "t 8 ""^" by subjecing the side* to 

the poty-t^ysme surtace was Uccked bvs VCni 
"cutation in a soWion of TOr^^^^L 

n^-2-pytroWnone (Ajaren) and 35 m of 1 M 
bone boo (pH 8.0). Drect>y after One btocltflg 
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tox the bound ONA was denatured by a 2-mh in- 
cubation in dotted water at -85*C. The sides ware 
then transferred irto s bath of 10Q% etftanoi at room 
tenperature, rinsed, and then spun dry in a oMcaf 
f****^ SWes ware stored in a dosed box at 
room temperature untt used. 
10. YPD merJum (8 Iters), in a 10-fter te m i en ialjun 
vessel, was inoculated with 2 ml of a fresh over* 
night culture of yeast strain DBY7266 (MATa, ura3. 
GAL2). The farmerrtor was maintained at 30^0 with 
constant agitation and aeration. The glucose con- 
tent of the meda was measured with a UV test Wt 
(Boehringer Mannheim, catalog number 716251) 
Cel density was measured by OD at 600-nm wave- 
length. Aiquots of cutture were rapioly withdrawn 
from the fe rm entati on vessel by peristaltic pump, 
spun down at room temperature, end then flash 
frozen wrth kxxJ nitrogen. Frozen ceits were stored 
at -BOX. 

11. Cy3-dUTP or Cy6-dl/TP (Amersharn) was hcorpo- 
rated during reverse transcription of 1.25 of 
po^denytated ipofytA)* ] RNA, primed byedT(i6) 
ofcgomer. This mixture was heated to 70*C for 10 
min, end then transferred to ice. A premrxed solu- 
tion, consisting of 200 U Superscript II (Gibco). 
buffer, deoxyribonudeoside triphosphates, and flu- 
crescent nucleotides, was added to the RNA. Nu- 
cleotides were used at these final concentrations- 
500 nM for dATP, dCTP, and dGTP and 200 uM 

for OTTP. Cy3-oVTP and CyS-dLTTP were used at 
a final concentration of 100 mJvI. The reaction was 
then incubated at A2*C for 2 hours. Unincorporat- 
ed fluorescent nucleotides were removed by first 
diluting the reaction mixture with of 470 mJ of 10 
mM tris-HO (pH 8.0)/1 mM EDTA and then subse- 
Quently concentrating the mix to —5 pi, using Cen- 
tricon-30 microconcentrators (Amicon). 
12. Purified, labeled cONA was resuspended in 1 1 ^ of 
3 -5 x SSC containing 1 0 m£ poryfdA) and 0.3 mJ of 
10% SOS. Before hybridization, the solution was 
boied for 2 min and then allowed to cool to room 
temperature. The solution was applied to the mi- 
croarray under a cover slip, and the sbde was 
placed in a custom hybridization chamber which 
was subsequently incubated for -8 to 12 hours in 
a water bath at 62"C. Before scanning, sfides were 
washed in 2x SSC. 0.2% SDS for 5 rrin. and then 
0.05x SSC for 1 min. Slides were dried before 
scanning by centrifugation at 500 rpm in a Beck- 
man CS-6R centrifuge. 

13. The complete data set is available on the internet at 
cmgii LStantord.ecWpbrowrVexplcye^ 

14. For 95% of al the genes analyzed, the mRNA levels 
measured in cells harvested at the first and second 
interval after inoculation differed by e factor of less 
than 1.5. The correiaUun coefficient for the compar- 
ison between mRNA levels measured tor each gene 
in these two different mRNA samples was 0.9a 
Wien duplicate mRNA preparations from the same 
eel sample were comp a red in the same way. the 
correlation coefficient between the expression levels 
measured tor the two samples by comparative hy- 
bridization was 0.89. 

15. The numbers and identities of known and putative 
genes, and their homologie s to other genes, were 
gathered from the tolowing public ****^mes: Sec- 
charornyces Genome Database (genorne-www. 
stantord.edu). Yeast Protein r^^arm (quest7 
prtfeorrw.com). and Munich Information Centre tor 
Protein Sequences (speedy. m ip s. biocherTijnpg.oe/ 
m^s^east/indexJitmfac). 

16. A. Sender and K J. Schular. Mot Cat Sot 14 
3613(1994). 

17. S. Kratzer and K J. Schuter, Gene 161. 75 (1995). 
16. R J. Hasetbeck and H. L McAfcter, J. Biol. Cham 

268.12116(1993). 
19. M. Fernandez. E. Fernandez. R. Roddo. Mot Gen. 

Genet 242.727(1994). 
2a A.Han^efa/..A^udafc>^«BS.20.5677 (1992L 

21. R M. Martinez ef el t EM80J. 15.2227(1996) 

22. J. C. Vareta. U. M. Praekeft. P. A. Meacock. R. j 
Ptenta,W.KMac^. Mot Cat Bbt 15.6232(1995) 

23 * KR ^ ar ^ c - Schuler, Btoessayz 17. 959 (1995). 
24. J. L Parrou. M. A. Teste. J. Francoi s , Aticfcojctoov 
143, 1691 (1997). —~vr 



25. Tho expresscn profle was defcned as havra an 
noucbon of greater than I04oto at 1*5 hours and 
less than 11-toto at 20.5 hours. 

5JL? )r5 ^^ LGuar ^- G ^^3.li66 
(1989). 

27. ^ T. Otesen and L Guansnte. fctt 4, 1714 (1990). 

28. M. Rosenkrantz, C. S. Kefl, E. A. PenneB, L J. Oe- 
oo y^[^^ M * =r °^- 13. 119(1994). 

29. SngJe-tetter abbreviations tor the amrvo acid resi- 
dues are ias tolows: A. Ata; C. Cys; 0. Asp; E. Glu; F. 

* J* *• «e: K. Lys; L Leu; M. MetN 
^*£* T Q > ^ R. Arg; S. Ser; T. Thr; V. VaJ; W. 
tij^r? ''udeotioe codes are as tosowsr 

B-C, G. or T; N-G. A, T, or C; R-A or G; and Y-C or 



T. 

^ £" Fon ?? t Kato 9eropoutos. Comput AppL 

fi«a 12.363(1996). 
31. 0. Shore. Trends Genet 10. 408 (1994). 
r?' ^y - Ptemaan d H. A. Raue. tod. 4. 64 (1988). 
' ^ a deoener 3te consensus sequence VYCYRNNC- 
MNH "?» "sed to search tor potential RAP1 -bndra 

£?n J^^i consensia ' 85 by m. is 

**^^ACATYW. with up to three cffler«x 
es aiowed. 

34 ' ^o h !!^o S ;, Bhattachaw J - a Broach, Mot. 
Ceff.Sot 15. 3187 (199$). 

^ Mrif? 896 * X Yans ' M * Caiso ^ ***. 16. 1921 
(1996). 

36. For example, we observed large nductions of the 
genes i coding for PCK7. FBPl \Z. Ym et at.. Mo/. 

20 - 751 0996)). the central gryoxytete 
cycle gene /a 7 (A. Sender and H. J Schuter 
Cu/r. Genet 23. 375 (1993JJ. and the "aerobkr 
©otcrnn of acetyt-CoA synthase. ACSl \M. A. van 
oen Berg ef al.. J. Btot. Chem. 271, 28953 (1996)1 
wrm cwTcomitant down-regulation of the glycoryt- 

M riZJ^' *^ 11 ' 5330 (1991 «- Other genes 
W^ectfy irTvolved rn carbon metabofism but 
w^own to be nduced upon nutrient limitation tn- 
u ^ neS encodin 9 cytosoic catalase T CTT1 
r^H. Bissmger ef a/., ibid. 9. 1309 (1969)] and 
several genes encoding small heat-shock proteins, 
such as HSP12. HSP26. and HSP42 (1. Farkasei 

Q7^n^ ; A ' Meacock - M * Ge"- Genet. 223. 
lU^^ et ^ J ^^ "1. 
37. The levels of induction we measured tor genes that 
!*^*^f*? at very tow levels in the uninouced 

nantoose previously reported. This discrepancy 
^^^Oueto the conservative background sut> 
used, whicn generally resulted in 
afl ^y? 10101 ^ »ow express^ teveis (46). 
36. C>os54iytxxtaaton ol h^hry reteted sequences can 
2» occasional obscure changes in gene expres- 
soOjan trrponant concern where members of gene 
ranges ^arefurxaionaOy specialized and drflerentiafly 

^^Tr^ ***** <tehyorcoenase genes, 
AOWTand ADH2. share 88% nucleotide identity. 
^^^^ocairegLriation ol tftese genes ts an imponanl 
mature of the diauxic shift, but was not observed in 
« experiment, presumably because 01 cross-hy- 
ondaabon of the fluorescent cDNAs representinQ 
J^ffJ* 0 genes - Nevertheless, we were able to oe- 
wcttffferential expression of closely related isof orms 
oiomer enzymes, such as HXKVHXK2 (77% iden- 

r^Fn^Zll 8 ' ' Yeast "- 137 (1995)]. MLSU 
QA17(73% rtentcal) (2d), and PGM1/PGM2 (72% 
J^caOP Oh, J. E. Hopper. Mo/. Cefl. Bk>!. 10, 
l<l S_n990ai.f i accord with previous studies. Use * 
xne "* c J°j8nTry of deliberately selected DNA se- 
W«nces corres ponding to the most divergent seg- 
ments of homologous genes, in beu of trw complete 
9ene sequences. shoukJ relieve this problem in many 



40. a Tzamarias and K. Stnjhl. rVerure 369. 758 (1994) 

41. C^ences in mRNA levels between the tupiA and 
wwype^rain were measured in two independent 
ewnmems.Thecon^to 
e«npiete sets of expression ratios measured r 
mese dupkeate experiments was 0.83. The concor- 
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^ Twee* of 

benouoedwesveryNght-, . mm 

J«»* Whan orty the 356 ganaa fax 
least a twofold increase in * 
h aJhar of the ducacsM 
P*** the conatabon coafioent teas 082. 

4,2 Z^r^Si^Jr^? 00 ^^^^L - " 4rmaertior> of tt« 
J»« codng seouenos. txajring a stop oodoabs* 
tweentheATBof 7LP7 and an toRlilo?lSoiM 
PW before the stop oodon of the 7LP7 gjene. 

41 ii^Zm."- ^ ^ ^ ***** 

Y- S. Cong. J. 

45. D Hirata, K. Yano. T. Myakawa. A4ol Gen. Genet 
242.250(1994). *««*uen. oenet 

^ a" T*^!II2Lt ^ ararn *to. A. Prteto, M. J. Martinez. 
0994)^ rt>n6Z ' APPL EnVm ' * faotooi M * ^783 

la! ^ * ^ ' ^ ^ gDoftarn 1 369 (1 
l!x^^ m^n,e, Szctypka, D. J. Thiefe, W. S 
IT^^-Ko^ey. J. &ot Cham 269, 32592 (1994k 

^SSI^^ 8 !!^!^ • custonvouli 
scanntn 9 tear mci oa uu p e bult by S. Smfth with 
software written by N. Zrv. Oetais ccflcernng scan- 
n^es^andcons^ 

r^^^f* 0 *"- scanned it a 

resc *rt J on of 20 urn per pbcai. A separate scan. usinQ 

me appropriate excitation ir^ was done lor each of 

cess, the ratio between the sonars hi the two t^an* 

^^ f ^^^Ba^aff 9 yalafTmu^ 
^^Jf^^CNA. To normakce the two 
channels with respect to overaR fcteredty, we than 
«*«ted photomultipfer and laser power settra 
«h that the signal ratio at these elements was as 
fj^fj^ 1 .0 as pass**. The oorrt)inedrnaoeswere 
analyzed with custorn-wrttten software. A boundra 
txaoi, fitted to the size of the DNA spots h each 
Quadram, was placed over each array element The 
averao^fkjrjrescent intensity was calculated by sum- 
^nojherttenstoes* 

«gbox. and then drvidvig by the total number of 
P«8ts. Local area background was rai^*^ ^ 
eachanay elemem by detemw^ 

^^5 XXJ8h ^ metnod 16005 to "ncJerestimate 
^>e oackground. causing an unoeresbrnation of ex> 
treme rate. It produces e very co nsist en t end noise* 
l^^- appr ° xrnat ^ Although the anatoo-to- 
dgal board used for data coiecton possess Ta 
wide dynamic range (12 bits), several signals were 

tt^^^H^ 8eten » s - therefore, extreme 
ratios at bright elements are generaty underesttr*. 
eo. Asjgnal was deemed significant If the evaraoe 
r52f 2? ******* subtraction was at least 
iZT* 0 h| fl her tn *n the standard deviation w\ the 
Dackground measurements tor al etomentsonthe 
array. 

50. m addrtion to the 17 genes shown ri Table 1 three 
**°*onal genes were induced by an average of 
more than threefold in the duplicate expenrrientr but 
in one ofi tpe two experiments, the induction was less 
than twofold (range 1.6- to 1.9-fold) 

51. We thank H. Bennett. P. Soeflman, J. Ravetto. M. 

bars of the Brown lab for their assistance and hetofii 
^■^^ Fnend. a BotsteioS 

Smth. J. Hudson, and D. Dokjinow for advice sur> 

^^Tf X ??! m f n: ^* Strutu and S. ChatteV- 
EL^ Tup1 detet,on L Femanoes for 
helpful advice on Yapi; and S. Klaphotr end the 
fevtewers for many helpful comments on themanu- 
senpt. Supported by a grant from the National Hu- 
man Genome Research Institute flsiMStan 
WCO450). and by the Howard Hu^MaSESL? 
stitute (HHMl). J.D.R. was supported byTeHHVB 
and the NHGRI. V.R. was supported ^ byi 
Instrtutional Training Grant in Genome Science ( T32 
HG00044, from the NHGRI. P.O.B. fe^s^cS 
nvestigatoroftheHHMI. 
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We describe here a method for drug target validation and identification of secondary drug tar- 
get effects based on genome-wide gene expression patterns. The method is demonstrated by 
several experiments, including treatment of yeast mutant strains defective tn calcineurin. in> 
munophilins or other genes with the immunosuppressants cyclosporin A or FK506 presence or 
absence of the characteristic drug 'signature' pattern of altered gene expression in drug.treated 
cells with a mutation in the gene encoding a putative target established whether that target was 
required to generate the drug signature. Drug dependent effects were seen in Wgetless' cells 
showing that FK506 affects additional pathways independent of calcineurin and the im-' 
munophihns. The described method permits the direct confirmation of drug target* and recog- 
nition of drug-dependent changes in gene expression that are modulated through pathways 
distinct from the drug's intended target Such a method may prove useful in improving the JfC 
ciency of drug development programs. / ■ % 



1 * 

Good drugs are potent and specific; that is. they must have 
strong effects on a specific biological pathway and minimal ef- 
fects on all other pathways. Confirmation that a compound in- 
hibits the intended target (drug target validation) and the 
identification of undesirable secondary effects are among the 
main challenges in developing new drugs. Comprehensive 
methods that enable researchers to determine which genes or 
activities are affected by a given drug might improve the effi- 
ciency of the drug discovery process by quickly identifying po- 
tential protein targets, or by accelerating the identification of 
compounds likely to be toxic. DNA microarray technology, 
which permits simultaneous measurement of the expression 
levels of thousands of genes, provides a comprehensive frame- 
work to determine how a compound affects cellular metabolism 
and regulation on a genomic scale 111 . DNA microarrays that 
contain essentially every open reading frame (ORF) in the 
Saccharomyces cerevisiae genome have already been used success- 
fully to explore the changes In gene expression that accompany 
large changes in cellular metabolism or cell cycle progression' 10 . 

In the modern drug discovery paradigm, which typically be- 
gins with the selection of a single molecular target, the ideal in- 
hibitory drug is one that inhibits a single gene product so 
completely and so specifically that it is as if the gene product 
were absent. Treating cells with such a drug should induce 
changes in gene expression very similar to those resulting from 
deleting the gene encoding the drug's target. Here we have com- 
pared the genome- wide effects on gene expression that result 
from deletions of various genes in the budding yeast 5. cerevisiae 
to the effects on gene expression that result from treatment 
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with known inhibitors of those gene .products. Using the cal- 
cineurin signaling pathway as a model system, we tested an ap- 
proach that permits identification of genes that encode proteins 
specifically involved in pathways affected by a drug. The FK506 
characteristic pattern, or signature*, of altered gene expression 
was not observed in mutant cells lacking proteins inhibited by 
FK506 (for example, a calcineurin or FK506-binding-protein 
mutant strain), but was observed in mutants deleted for genes 
in pathways unrelated to FK506 action (for example, a cy- 
clophilin mutant strain). Conversely, the cyclosporin A (CsA) 
signature was not observed in CsA-treated calcineurin or cy- 
clophilin mutant strains, but was seen in an FK506-binding-pro- 
tein mutant strain treated with CsA. The method also 
demonstrates that FK506. a clinically used immunosuppressant, 
has off-target* effects that are independent of its binding to im- 
munophilins. Thus, the approach we describe may provide a 
way to identify the pathways altered by a drug and to detect 
drug effects mediated through unintended targets. 

Null mutants phenocopy drug-treated cells on a genomic scale 
To test whether a null mutation in a drug target serves as a 
model of an ideal inhibitory drug, we examined the effects on 
gene expression associated with pharmacological or genetic in- 
hibition of calcineurin function. Calcineurin is a highly con- 
served calcium- and calmodulin-activated serine/threonine 
protein phosphatase implicated in diverse processes dependent 
on calcium signaling" ,3 . In budding yeast, calcineurin is re- 
quired for intracellular ion homeostasis' \ for adaptation to pro- 
longed mating pheromone treatment' 4 and in the regulation of 
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****** v**?**™ ^^e*«eurin signal^ jMttiv^niediatrt 
* «506 and cydnpark. A (CsA). C**eurln activity feT SnpSd^aT 
We subunlt (caldneurln A. encoded hi yeast by the CAM} and CNA2qenes) 
J^^^Wndlng regulatory suburft, ealmoduto(CMO)arK>caleinewhB 
rcnW^emering eefc. FK506 am. CsA spedtoBy bind arid inhibit the 
P^tfyH«*ne isomerast l aetMiy of their, .respect immunbp^ 
9 n *9J™«™ (FKBP) and IcydpphSm (Cylo/The^Smfe 
ST^!^!!" 1 ffp ^ ^ « thougmtomeciiattcatt^in. 
NbWca OrugHmmunophiin complexes bind and inhfctt the calcium, and" 

pneunn are vamcnptjonal activators that 
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the onset of mitosis", Jn mammals, caldneurin has been impli- 
cated in T-cell actlvaOon"; In apoptosls". incardiac hypertro- 
phy"and In the transition from shbrMerm to long-term 
memory". In both organisms, caldneurin activity is inhibited 
by FK506 and CsA. immunosuppressant drugs whose effects on 
caldneurln are mediated through families of intracellular recen- 
tpr proteins caUed immunpphilins'"» (Fig. j). To asses, the e£ 
fens of pharmacplogic inhibition of caldneurin. wild'type S. 
eereKfaiae was, grown : to. early logarithmic phase in the presence 
or absence of FK506 or CsA. Isogenic cells, from which the 

rvf* H C ?!! n8 * Ubun,tS of -'""""rtn (CAM Jand 

CNAZ) had been deleted" (referred to as the cm or caldneurin 
mutant), were grown in parallel, in the absence of the dnw 
Fluorescently-labeled cDNA was prepared by reverse trarucrS' 
tion of poly* RNA in the presence of Cy3- or CyS-deoxynu- 




unrelated pathways, and none.had statistically sfcnificant cm- 
relauons These data ^establish, that genedc /^pdon SSL 
cmeunn function provides* dose and specific' pher^cbovTf 
treatment with FK506 or CiA. - '>.n£^rT!!!%Rf* 
Jo avoid generalizing ftbm a single 'example: wV'also cornJ 

S« It! l?H°? ea r m orwl,d -«yP« cells with 3-aminotri. 
«o* (3-AT) with the effect, .of. deletion of the HIS3 gene HIS3 
encodes unidazoleglycerp, , phosphate dehydrause. wh£h^ 

^S7^ «H he h,SUd,ne ■*■*»*■* pathwayTh 



cleotide triphosphates and 'then hybridized io7SS£ ^^T *** ^ne bic^^ ^ y Th 
containing more than 6.000 DNA probes represent™ 97% of £TT'. com Petitive inhibitor of this enzyme that tri* 

2L £=. «E"g- _?"* W yeast lenomr 



the known or predicted ORF, m .the yeast genome 
Simultaneous hybridization of Cy5-labeled cDNA.from modo 
treated cells and Cy3-labeied cDNA from cells treated with 1 
ug/ml FK506 allowed the effect of drug treatment on mRNA lev- 
els of each ORF to be determined (Fig. 2a and b and data not 
shown) Similarly, effects of the caldneurin mutations on the 
mRNA levels of each, gene were assessed by simultaneous hy 

^^ ti °^I ) ' Cy5 * tobe,ed CDNA fTOm wUd t yP e "lis and Cy3- 
labeled cDNA from the caldneurin mutant strain (Fig 2d For 
each comparison of this kind, reported expression ratios are the 
average of at least two hybridizations in which the Cy3 and Cv5 
fluors were reversed to remove biases that may be introduced bv 
gene-spedfic differences in incorporation of the two fluors 
(data not shown). 

Treatment with FK506 in these growth conditions resulted in 
a Signature pattern of altered gene expression in which mRNA 
evels of 36 ORFs changed by more than rwSS 
(http://www.rosetta.org). A very similar pattern of altered sene 
expression was observed when the caldneurin mutant strain 
was compared to wild-type cells. Comparison of the changes in 
mRNA expression of each gene resulting from treatment of 
wi d-type ceUs with FK506 with mRNA expression changes re- 
suiting from deletion of the caldneurin genes showed the con- 
siderable similarity of the global transcript alterations in 
response^ the two perturbations (Fig. 26-d). Quantification of 
this similarity using the correlation coeffident (p) showed 
large correlations between the FK506 treatment signature and 
the caldneurin deletion signature (p - 0.75 * 0.03) as well as 
the CsA treatment signature (p - 0.94*0.02). but not with a 
randomly selected deletion mutant strain (deleted for the 
YER071C gene: p - -0.07 * 0.04; Fig. 2e). The FK506 treatment 
signature was also compared with those of more than 40 other 
deletion mutant strains or drug-treatments thought to affect 



Microarray- analysis' of ^iSff, 
= demonstrated the expicS large genSwS^^ 
«onal responses (involving j more, than ;i;000 ORFs) VeSj 
from treatment with 3-AT (Fig. 3a) or from H1S3 deletWffJ 

andrh" i!l , 6 COmP?riS ° n ' 0f ?-AT treatment signature 
Z 076 ^^^""^'howeda high level of conflation 
s^n It ? ^ eXte " ded to that experienced 
mall changes in expression level (Fig: 3b): As a negatlvS contJoT 
U« correlations between the 3-AT treatment signature 0 ^ e * 
>*3 mutant signature and the caldneurin mutant striin we* 
not statically significant (, . 0.09 * 0.06 and -0 01 ^ 0 04 
pectively). That both the caldneurin/FK506 and the 
compansons were highly correlated indicates that in many cases 
^T° n Pr ° n,e re$Ukin8 frem 8 ^ -eleuon dosiy^ 

inTih 1 t k PKUi0n Pr0flle ° f Wi,d - t yP e cdb ««««« witJ Z 

inhibitor of that gene s product. ^ 
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BecT^ ^ 9 lar96t V8,idaU ° n With de,etio " 
Because pharmacological inhibition of different targets minht 

g vesimilar or identical expression profiles, slmp.e companion 

of drug signatures to mutant signatures is unlikely to unambigu 

ously identify a drug s target. To overcome this hmltatton * „ 

additional decoder' step is used. We first compare the exoreT 

mn profile of wild-type drug-treated cell, to theVpress,^ 

file, from a panel of genetic mutant , train,. u,ing I corrdaSo^ 

coefficient metric. Mutant strains who* expression pTS u 

IubiettVd° t h ^ drU «- treated wlld «yPe ce.U are selected an 5 
ubjected to drug treatment, generating the drug signature in 
the mutant strain (that is. the mutant drug signature) Tu2 

toZ tr' enCOdCS 8 Pf0tein ' nVO,Ved ln 8 P«hway affic Jd 
by the drug, we expect the drug signature In mutant cells to be 

st^T m abSem - f ° r a " idea ' dru « ) from d ~8 signature 
seen in wild-type cells. •»«B"«urc 
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Rg.'* txpnssion profile* from ; 
FK506-lre««d wiWrtype <wt) ; , f . ■■ 
celts and a calcineurin-disruptlon ° 
mutant strain share a genome*, 
wide correction: ON A microarray 
analysis sfcovmg changes in gene 
expression resulting from FK506 
treatment (a and b) >or. from ge- 
netic disruption of genes encod- 
ing calcineurin s. Pseudo- . 
color image of . the results of «si- - 
multaneous hybridization of Cy5- r 
labeled ' .cONA (red) from. v 
mock-treated strain R563 and • Cy3-tebeled cONA 
(green), from strain R563 treated with 1 ug/rnl FK506 
ft. Enlarged view of the boxed area in a. Arrowheads in- 
dicate specific ORFs induced or repressed, a. Pseudo- 
color image of the results of simultaneous hybridization ^ 
of Cy5-labeled cONA (red) from strain RS63 and.Cy3- 
tabeled cDNA (green), from strain MCY300 (deleted for 
the CNA1.CNA2 catalytic subunits of calcineurin). 
Arrows indicate specific ORFs induced or repressed, d 
The log w of the expression ratio for each, ORF derived 
from the FK506 treatment hybridizations is plotted ver- 
sus the tog«e of the expression ratio in the calcineurin 
mutant hybridizations. ORFs that were induced or re- 
pressed in both experiments are shown as green and 
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( To illustrate this, we treated the his3 mutant strain with 3- 
AT. The signature pattern of altered gene expression resulting 
from treatment of the mutant strain with 3-AT was much less 
complex than that of the 3-AT signature in wild-type cells (Fig. 
4). This is seen simply by examining plots of mean intensity of 
the hybridization signal (which approximately reflects level of 
expression) versus the expression ratio for each ORF (Fig. 4). 
Genes that were expressed at higher or lower levels in 3-AT 
treated cells or in hJs3 mutant cellsare shown as red and green 
dots, respectively. We analyzed the 3-AT signature in wild-type 
(Fig. 4a) and his3 mutant cells (Fig. 4c). as well as the his3 mu- 
tant strain signature (Fig. 4b). Whereas histidine limitation in- 
duced by 3-AT induced more than 1.000 transcription-level 
changes in the wild-type strain, few or no transcript level 
changes were induced by treatment of the /i/s3-deletion strain 
with 3-AT. This indicates that with the growth conditions used, 
essentially all of the effects of 3-AT depend on or are mediated 
through the HIS3 gene product. 

Applying this approach to the calcineurin signaling pathway 
showed the specificity of the method. The calcineurin mutant 
strain and strains with deletions in the genes encoding the 
most abundant immunophilins in yeast 1 * (CPHJ and FPRl) 
were treated with either FK506 or CsA to determine the profiles 

Table 1 Signature correlation of expression ratios as a result of FK506 
treatment in various mutant strains 



wild-type 
4/-FK506 



enafprl 
♦/-FK506 



— — ^ — — — ■ V. r J. 

Signature concution Hums iht iDsence of the FKS06 wmiiur. uwth^.ik. - .... • . ' 
(major FKS06 binding protein) Meton muunu. cwr^nttt^l^H T 8n0 *" 

unit, of Caneurm. OVUl and CNA2. The correUt™£5S £££ in JTrTT" ,Ut " 
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of altered gem expression resulting from drug treatment of the 
mutant cells (that is. mutant */. drug). We compared the drug 
signatures in the mutants to the wild-typedrug signature using 
the correlation coefficient metric (Table'lf. Although the signa- 
ture generated by treatment of wild-type cells with FK506 was 
highly correlated to the calcineurin mutant strain signature (p 
- 0.75 ± 0.03). it bore no similarity to the profile after treat- 
mem or the calcineurin mutant strain with FK506 (p - -0 01 ± 
0.07). This indicates that FK506 was unable to elicit its normal 
transcriptional response in the calcineurin mutant strain 
Likewise, treatment of the fprj mutant strain with FK506 
elicited an expression profile that was not correlated to the 
FK506 signature in the wild-type strain (p - -0.23 ± 0.07) Indi- 
cating that the FPRl gene product is likely to be involved in the 
pathway affected by FK506. The same was true for the cna fori 

w^re?™"' ln COntra "' trea,ment ° f the c P h ' muta "t «"»n 
with FK506 generated an expression profile highly correlated 

with the wild-type FK506 expression profile (p - 0 79 ± 0 03) 
indicating the cphl mutation did not block the mode of action 
of FK506 and thus is not directly involved in the pathway af- 
fected by FK506. We tabulated the change in expression in re- 
sponse to FK506 in different mutant strains for all ORFs with 
expression ratios greater than 1.8 in FK506-treated cells or in 

the calcineurin mutant strain (Fig. 5 a) .The 
calcineurin mutant strain signature and the 
FK506 responses in wild-type and the cphl 
mutant strain are similar, and there are no 
transcript-level changes (seen in black) for 
treatment of the calcineurin. /pr7 and cna 
fprl mutant strains with FK506 (Fig. 5 a ). 

Similar experiments and analyses with CsA 
provided further validation of this approach. 
The expression profile elicited by treatment 
of wild-type cells with CsA was highly corre- 
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f,0f P*A AWimitant'- suain 
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$h0, W9 changes : .tr> t.gene 
expression resulting from 3- 
A7 trea tment ( s) or from ge- 
netfc disruption of the H/SJ 
gene.' ' (c) . ^e,.' ' teudo-colbr 
image of the resufts of simul- 
taneous hybridaeUon of . „. 

CyS-labeled cDNA (red) from mock-ueeted wild-type strain R4 91 and 
Cy3-labeted cDNA (green) from strain R491 ueated with 10 mM 3-AT 
b. Plot of the log* of the expression ratio for each ORF derived from the 
3-AT treatment hybridizations is plotted versus the tog* of the expression 
ratio inihe Ns3 mutant hybridizations. ORFs that were induced or re- 
pressed in both experiments are shown as green and red dots respec- 
tively. The correlation of expression ratios applies not only to genes with 
large expression ratios (for example. CHA 7 and ARG 7). but also extends to 
genes with expression ratios tess than 2 (for example. ILV1 and CPH1) 
li VI is induced 1.9-fold and 1 .5-fold, and CPH1 is downregulated 1 9-fold 
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lated to the profile elicited by mutation of the calcineurin genes 
(P - 0.71 ± 0.04). but did not correlate with the expression pro- 
file resulting from treatment of the calcineurin mutant strain 
with CsA (p . -0.05 * 0.07; Table 2). indicating that the genetic 
deletion of calcineurin interfered with the ability of CsA to 
elicit its normal transcriptional response. Likewise, the CsA sig- 
nature was essentially absent in CsA-treated cphl mutant cells 
and the expression profile of CsA-treated cphl mutant cells cor- 
related poorly to that of CsA-treated wild-type cells (p « 0.18 ± 
0.07). Thus, the CPHl gene product was required for the CsA re- 
sponse seen in wild-type cells. Conversely, treatment of fprl 
mutant cells with CsA resulted in an expression pattern very 
similar to the profile of CsA-treated wild-type cells (p - 0 77 ± 
0.03). indicating that FPX1 was not necessary for the CsA-medi- 
ated effects. Analysis of individual ORFs affected by CsA and 
their expression ratios over the entire set of experiments con- 
firmed that CPH1 and the genes encoding calcineurin. but not 
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FPJtl. are necessary for the wild-type CsA response (Fig. 56). The 
observation that the profiles resulting from FK506 or CsA drus 
treatment are similar to that of the calcineurin deletion mutant 
strain might allow the prediction that calcineurin was involved 
in the pathway affected by these drugs. But because the expro- 
won profile of the fprl mutant strain did not bear a strong simi- 
larity to the wild-type drug expression profile for FK506. it is 
obv,ous that the drug treatment of the mutant strains was nee- 
essary to identify Fprl . but not Cphl . as a potential FK506 drug 
target In the same way. the decoder" strategy was necessary to 
identify Cphl. but not Fprl. as a potential drug target for CsA. 

Decoder' approach can identify secondary drug effects 
For a drug that has a single biochemical target, the strategy out- 
lined above may be useful in target validation. In manycases 
however, a compound may affect multiple pathways and elicit 
a very complex signature. Decoding 1 such a complex signature 
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Fig. 4 Treatment of the his3 mutant suain with 3-AT shows nearly com- 
plete loss of 3-AT signature. A plot of the log* of the mean intensity of hy. 
br.di23t.on for each ORF versus the log* of its expression ratio for each 
experiment is shown next to a pseudo-color image of a representative 
portion of the microerray. ORFs that are induced or repressed at the 95% 
confidence level are shown in green and red, respectively. «. Expression 
profile from treatment of the wild-type (wt) strain with 3-AT. CyS-labeled 
cDNA (red) from mock-treated strain R491 and Cy3-labeled cDNA 
(green) from suain R491 treated with 10 mM 3-AT. 6, Expression profile 
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Log* (intensity) 



from th, h B 3 deletion strain. CyS-labeled cDNA (red) from suain R491 
and Cy3.labeled cDNA (green) from swain R1226. deleted for the HIS3 
* J x P ression P'°Ne of treatment of the r*3 deletion strain with 3- 
AT. CyS-labeled cDNA (red) from /i«3.deleted suain R!226 and CvS-la- 
beled cDNA (green) from strain R1226 treated with 10 mM 3.AT 
Arrowheads indicate the DNA probe and data point corresponding to the 
MS3 gene. The blue dashed line represents the threshold below which er- 
rors tend to increase rapidly because spot intensities are not sufficiently 
above background intensity. »»»a«nuy 
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Table 2 Signature correlation of expression ratios as a result of CsA 
treatment in various mutant strains 



wOd-type 
♦/-CsA 



wild-type 
♦/-CsA 



cna 
♦/-CsA 



fpri 
♦/-CsA 



cnacphl 
♦/-CsA 



cphl 
♦/-CsA 



0.94 i 0.04 -0,05 t .07 0.77 ± 0.03 



n *0 07 0.18 ±0.07 



s 

w 
c 

| 

O 

E 



6 

I 

a) 
E 
< 

£ 

« 
Z 

CD 



signature correlation shows the absence of the CiA t«w«n^ i *. 

(cycJophUin) deletion mounts, cne represents t^rn^^L^ * ,hB («a) end cpnl 

c«~nr, CAM, end CNA2. The con^^^^l^ of the catalytic subunft, of a,. 

betwtCfll ^P^ of hybridizations from tndependenTiud lv£ / ?l C ° U,mn «W«s the correlation 

*yP* CsA experiments. 



nto the effects mediated through the intended target (the on- 
target signature-) and those mediated through unimended «r 
gets (the -ofT-target- signature) might be use*! in 
compound's specificity. Our decoder" strategy is based onX! 
premise that off-targef signature should tataZL To Z 
genetic disruption of the primary target nsenslt,ve ,0 the 
To determine whether the decoder' approach could identify 
an off-targef profile, we looked for a Lg-resp 0nsj ve S 
whose expression is insensitive ,o deletion of the primary ar 
get. To increase the likelihood of observing such ge"e7 he 
same strain, described in Tables . and 2 were treated ^ 
higher concentrations (50 ug/ml) of FK506. This led to a much 

^JSZSF"" Pr< ! n,e * wa *W «*. indicating 
vain. 25i* ? COneen ' ratl0n - 0(506 was in W°iting or acti 

rSS „d 7 tar8eU - SeVCraI ° f the 0RFs in this expanded 
FK506-induced expression profile were not affected by the cal 

tant strains did not block their presence in the FK506 
expression signature (Fig. 6). This indicates that FK5o7was trie 

w^Th 8e , 5 ,n tra " S ? ipt ,eV6lS ° f Senes through pat* 
ways independent of calcineurin. CPHl and FPRl Manv of ,h- 

unregulated ORFs in the off-targef pathway we« g T« re 
ported to be regulated by the transcriptional activafor Gcn 4 
(ref. 24). In some strains, a reporter gene under GCN4 control 
was induced in response to FKS06 treatment" To * 
whether CCN<* involved in this V^^H^Z 
of calcineurin. CPHl and FPR 1. we analyzed the effects of treaT 
ment with high-dose FK506 on global Rene exlrZinl? 
strain with a CCN4 deletion (Fig. 1,. 5 ffj , "g™*^ ^ 
aneurin-independen. expression ratios greater thaT/az w ere 

^lEE^SSE mU H am ' r C8Ung ,h8t ^eirlndLTon 
oy FK506 was CCAW-dependent. Not all CCAM-regulated eene* 
were induced by FK506. This FK50$-lnduced subse t of 

fTcSffT ^ "V"" ^ <° e hangi 

in Gcn4 level,, or perhaps other regulatory circuit, prevem 

FK506 activation of ,ome COVV-regulated genes, iven 0 f7h e 

remaining nine ORFs mduced by FK506 wire independent of 



both the caldneurtn and CCN4 pathway,. The 
simplest explanation is that FK506 inhibits or 
activates additional pathways. Members of th* 

t7 T*^ 5 " 02 and ** « en « that en! 
code drug efflux pumps with structural homol- 
ogy to mammalian mulUple drug resistance 

Pdr5 to mhiblt its function". Our results indi- 
cate that treatment with FK506 leads to four- 
fold-to-sixfold induction of PDRSmRNA levels 
YORl. another gene that can confer drug res*." 

airs: - » "ssrr 

procesjes 'directly or Indirectly affected by the dr^KZ. " 
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Fig. 5 Response of FK506 and CsA sionatt*p n*~~ ^ ~„ 

response to Vestment with 1 M9 /ml FK506 (a) « SO^To^i f« J 8 

green (.nOuction^ed (repression) color scale. » CaldnT^ ^ 
end FKS06 ueaunem sigr-u,. oenes m h £ 
FKS06 signature genes have expression ratios near iimtv hwuJ. 
involved in pathwy, affened by FK506 (ca^h^ 7 T^T r'"* 

icna) mutant and CsA treatment signaturV oenT«Tta , 1 l" r fc,ne,jr,n 

columm.AlnK)st.UCsA^mreger^ve ex^^,.L ^ ,W0 
deietion «r tins invo^ ^yT^ 0^1^, ^^ h 

^ ^ ' ™» n *> but no, in option in Le^Z^T 
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no similarity to the wild-type drug expression profile In rn„ 
trast. drug-mediated signatures from stains vStE TuuUomTn' 
genes involved in pathways unrelated t0 ^g^on 
showed extensive similarity to the wild-type drug s^ lu " fiv 

SS? th * approach lo a dru8 *" affecu 

(FK506). we were able to decode a complex signature imo coT 
ponem parts, including the identification of an 

2JL2 11 W3i . '? edl8,ed ,hr0U8h """^ '"depeXm 0 
calcineurinortheFprl immunophilin. 

Discussion 

It is well-established that high-throughput biochemical screen 

specificity: the tendency of , ^dT^' 1 ^ 
other than that of its intended target The ahmtv ^ 2 * 

So c Lr ar8et ' en r *» ,,ke,y * 

Profiling compounds with known toxicities will allow the ,£ 
velopment of a database of expression changes associated wfth 
particular toxicities. RecogniUon of potential t£5ta£ the 
off-targef signatures of otherwise promisinp com™. * 1 

™?»«owe,r,,e,iden^^^ 

«tab. Comparing the extent and peculiarities 0 f off-u rge ! 
natures of promising drug candiates could provide nf w way 
to group compound, by their effects on secondary pathways 
even before those effects are understood. This may^ov^oTe 
an alternative, potentially more effective, way to select «m 
pounds for animal and clinical trials. Some drL are mo«Tr" 
fective against a related protein than agains^he or7Z..v 
intended target. Sildenafil <Viagra~). f or LZ^Snt 
developed as a phosphodiesterase inhibitor to conTS «m ^ 
contractility, but was found to be highly soec?fir fn, 1 u 
diesterase 5. an isozyme whose inhibftio'n £g£ 
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fin. 6 Response of FK506 sionitir* n*n»« k «~ 
« different genes. Genes ^^^2!^ 
of 4 in at least one experiment mil^TH^S^^ " mXot 
the indicated strainT^!^ • nd ?*«Won«iosfc, 

*ton) color scaT^c^hl^f^ QnMn >^ <*P~- 
/ ~> w. ine genes have been divided into dans* rru._ 

«Pond«g to these expected behevic^O^eMrS? ^ 
"W7 or both ere deleted: •CC«4^^r~l-^^^ Bene * * 

COM^ndtpandM- ger^/es^Sd^fn? ' "JfH ^ * ,d 

tested. A complex behav^'^! KS06 a " del " iofl 

Lumpwx oenavior class o provided tor those oeme* ih.t 

not match the model of FK506 taoo^Z^Zi^ " 
eineurinwfprl w *p a ,,tehyOv«i^. mrt " ttd 



penile erection. It is possible that application of the de 

strategy will be the efficient selection decoder 1 
n„~, i se, ecuon of the mutants with del*. 

Uon, n genes most likely to encode the intended d™« ur.« 
The signature correlation plots described -™ - 8 * 
could be used as par, of that selection ^ I ' metflC that 
to be explored. Applying Z dicodl? T ° thm need 

sents additional "S^ u ?**"' ™ mmali * n ce »* P«- 
isolate functional* ^'^^ZZSZSS 
able promoters, known specific inhibitors Ani££?X2 h 
bozymes. and methods of tareetinp ^.n. 
degradation are possibie and teTted ^oXm? 

tion is that not all cell tvoes e*„r«.« Anoth er Hmita- 

therefore ofU^Ji^ ^ 2X~£ £ £f"- «- 
type,, n addition. appl y jn g , he y d J?™^ ?™ wl 
abo require technical improvements that allow exprLion B - 
Ming from a small number of cells Even th.Z^ J P 
of whether the insensitivity of "ff ta 'e" s^ T" 0 " 
ruption of the main target is the LcJ^l Sig ™ wes 10 the «lis- 
be an by tnfi JS^^Z^^ 

Leibier. however, have argued in favor of r«K. ? / nd 
cal networks, indicating 8 , ha, J" t T^SST^tT^ 
signatures) may be robust even when ,h» ! ( off - ,ar 8« 

another perturLion ^TZ^^S^SSS^ 
Many practical developments will be necessar^ff hV decodf ^ 
concept is to be broadly applied. W 
Expression arrays have been used mainly as an initial screen 
or genes induced in a particular tissue or process orS»S£ 

nd LT::"- thi " eff0rt ,0 refine «P-""ental p^tocoh 
and repeat expenments increases the reliability of the daiT»nd 
permus new applications. For example, it proJ,lX ef £ 
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Table 3 Yeast strains used 



Strain 

YPH499 

R563 

R558 

R567 

MCY300 

R132 

R133 

R559 

BY4719 

BY4738 

R491 

BY4728 

BY4729 

R1226 



Relevant genotype 

Mats ura3-S2 Ip2-801 ade2-101 trpl 
Mats ura3 52 tys2-80l ade2-101 trpl 
Mats ura3-$2 Iys2-801 *o*P-707 trpl 
Mats ura3>S2 lys2-80l ade2-W1 trpl 
Mats ura3-52 ty*2-801 sde2-101 trpl' 
Mats urs3-52tyi2-801 sde2-101 trpl 
Mats ura3 52 Iys2-601 ade2*W1 trpl 
Mats ura3>52 Iys2-801 sde2-101 trpl 
Mats trpl -463 ura3-A0 
Mata trpUA 63 ura3-40 
Mats/a BY4719 X6Y4738 
Mats tts3>A200 trpl -463 ura3-A0 
Mata his3-4 200 trph463 ura3-40 
Mats/a BY472B XBY4729 



-463 Ns3-4200 leu2-4 1 
-463 Ns3-6200 leu2-4 1 h&3::HiS3 
-463 ha3-4200 ku2-4 7 fpr1::HtS3 
-463 h*3-4200 leu2-4 1 cph1::HIS3 
-463 ha3-4200 ieu2-4 1 aia 14 IrMcG ena24 1::H1S3 
-463 h*3-A200 Ieu2-A 7 cna 14 7;;/wG cna2A 1::HIS3 cph1::karf 
-463 ha3-4200 leu2-4 7 cna 14 1::hisG cna24 1::HIS3 fprl::karf 
-463 ht*3.A200leu2-4l hi$3::H!$3 gcn4::IIU2 



Reference 
(34) 

(this study) 
(this study) 
(this study) 
(21) 

(this study) 
(this study) 
(this study) 
(35) 
(35) 

(this study) 

(35) 

(35) 

(this study) 



of genes at higher confidence levels that serve as a more 
unique signature for a given protein perturbation. In addition, 
it allows subtle signatures to be detected, when, for example, a 
protein is only partially inhibited. This may enable clinical 
monitoring of small changes in protein function in disease or 
toxicity states before they could otherwise be detected. 
Because the functions of many genes detected on transcript ar- 
rays are known, these microarrays are powerful tools that pro- 
vide detailed information about a cells physiology. For 
example, changes in the flux through a metabolic pathway are 
reflected in transcriptional changes in genes in the pathway 7 . 
Furthermore, it may be possible to indirectly measure protein 
activity levels from expression profiling data (S.F.. at a/., un- 
published data). Thus, although the eventual development of 
genomic methods allowing the direct measurement of all cel- 
lular protein levels will be an important achievement, tran- 
script array technology offers an immediate and robust means 
of evaluating the effects of various treatments on gene expres- 
sion and protein function. 

Methods 

Construction, growth and drug treatment of yeast strains. The strains 
used in this study (Table 3) were constructed by standard techniques". 
To construct strain R559, strain R563 was transformed to Leu* with plas- 
mid pM12 digested by Sah and MM (provided by A. Hinnebusch and T. 
Dever). Strains R132 and R133 were constructed by transforming the bac- 
terial kanamycin resistance cassette* flanked by genomic DNA from the 
CPH1 and fPRI loci, respectively, and selecting for G4l8-resistant 
colonies. For experiments with FK506, cells were grown for three genera- 
tions to a density of 1 x 10' cells/ml in YAPD medium (YPD plus 0.004% 
adenine) supplemented with 10 mM calcium chloride as described' 1 . 
Where indicated. FK506 was added to a final concentration of 1 ug/mi 
0.5 h after inoculation of the culture or to 50 pg/mJ 1 h before cells were 
collected. CsA was used at a final concentration of 50 ug/ml. Cells were 
broken by standard procedures" with the following modifications: Cell 
pellets were resuspended in breaking buffer (0.2 M Tris HCl pH 7.6. 0.5 M 
NaCI. 10 mM EDTA. 1% SDS). vonexed for 2 min on a VWR multi-tube 
vortexer at setting 8 in the presence of 60% glass beads (425-600 urn 
mesh; Sigma) and phenolxhloroform (50:50. volume/volume). After sep- 
aration of the phases, the aqueous phase was re-extracted and ethanol- 
precipitated. Poly A* RNA was isolated by two sequential 
chromatographic purifications over oligo dT cellulose (New England 
Biolabs. Beverly. Massachusetts) using established protocols". 

For experiments using 3-AT. wild-type or his3/his3 cells were grown to 
early logarithmic phase in SC medium, pelleted and resuspended in SC 
medium lacking histidine for 1 hr in the presence or absence of 1 0 mM 3- 
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AT. as indicated. Cells were harvested and mRNA isolated as above 
FK506 was obtained from the Swedish Hospital Pharmacy (Seattle 
Washington) and purified to homogeneity by ethyl acetate extraction by 
I Simon (Fred Hutchinson Cancer Research Center. Seattle. Washington) 
CsA was obtained from Alexis Biochemicals (San Diego. California)- 3-AT 
was from Sigma. 

Preparation and hybridization of the labeled sample. Fluorescentry-la- 

? A W8S prepared * P urified and hybridized essentially as be- 
scribed . Cy3- or CyS-duTP (Amersham) was incorporated into cDNA 
during reverse transcription (Superscript || ; Life Technologies) and puri. 
fied by concentrating to less than 10 ul using Microcon-30 microconcen. 
trators (Amicon. Houston. Texas). Paired cDNAs were resuspended In 
20-26 Ml hybridization solution (3 x SSC. 0.75 M g/ml polyA DNA. 0.2% 
5D5) and applied to the microarrey under a 22. x 30-mm coverslip for 6 
n at 63 *C. all according to a published method'. 

Fabrication and scanning of microarrays. PC R products containing 
common 5 and 3 sequences (Research Genetics. Huntsvilte. Alabama) 
were used as templates with amino-modified forward primer and unmod- 
ified reverse primers to PCR amplify 6.065 ORFs from the S. cerewstae 
genome. Our first-pass success rate was 94%. Amplification reactions that 
gave products of unexpected sizes were excluded from subsequent anely. 
sis ORFs that could not be amplified from purchased templates were am- 
plified from genomic DNA. DNA samples from 100- M I reactions were 
■sopropanol-precipiiated. resuspended in water, brought to a final con- 
centration of 3x SSC in a total volume of 15 ul. and transferred to 384- 
wen microliter plates (Genetix Limited. Christchurch. Dorset. England) 
PCR products were spotted onto 1 x 3-inch polylysine-treated glass slides 
by a robot built essentially according to defined specifications"' 
(http://cmgm.stanford.edu/pbrown/MGuide). After being printed slides 
were processed according to published protocols'. 

Microarrays were imaged on a prototype multi-frame CCD camera in 
development at Applied Precision (Issaquah. Washington), each CCD 
image frame was approximately 2-mm square. Exposure times of 2 s in 
the CyS channel (white light through Chroma 61 8-648 nm excitation fil- 
ter. Chroma 657-727 nm emission filter) and 1 s in the Cy3 channel 
(Chroma 535-560 nm excitation filter. Chroma 570-620 nm emission fU. 
ter) were done consecutively in each frame before moving to the next 
spatially contiguous frame. Color isolation between the Cy3 and Cy5 
channels was about 100:1 or better. Frames were 'knitted' together in 
software to make the complete images. The intensity of spots (about 100 
urn) were quantified from the 10-um pixels by frame-by.frame back- 
ground subtraction and intensity averaging in each channel. Dynamic 
range of the resulting spot intensities was typically a ratio of 1 000 be- 
tween the brightest spots and the background-subtracted additive error 
level. Normalization between the channels was accomplished by normal, 
izing each channel to the mean intensities of all genes. This procedure is 
nearly equivalent to normalization between channels using the intensity 



1299 



ARTICLES 



fifi 1998 Nature America Inc. . httpV/medTdne Jiature 



com 




i 



c 

i 

I 

n 



oo 

OS 



ratio of genomic ONA spots', but is possibly more robust, as it is based on 
the intensities of several thousand spots distributed over the array. 

Signature correlation coefficients and their confidence limits. 
Correlation coefficients between the signature ORFs of various expert 
ments were calculated using: 



P-IxOfc/ffVly* 1 ) 
k k k 



where x. is the tog* of the expression ratio for the r gene in the x sign*, 
ture. and * h the log m of the expression ratio for the k* gene in the y sio- 
nature. The summation is over those genes that were either up- or 
down-regulated in either experiment at the 95% confidence level These 
genes each had a less than 5% chance of being actually unregulated (hav- 
ing expression ratios departing from unity due to measurement errors 
alone). This confidence level was assigned based on an error model which 
assigns a tog norma I probability distribution to each gene's expression 
ratio with characteristic width based on the observed scatter in its re- 
peated measurements (repeated arrays at the same nominal experimental 
conditions) and on the individual array hybridization quality. This latter 
dependence was derived from control experiments in which both Cv3 
and Cy5 samples were derived from the same RNA sample. For large 
numbers of repeated measurements the error reduces to the observed 
scatter. For a single measurement the error is based on the array Quality 
and the spot intensity. 3 
Random measurement errors in the x and y signatures tend to bias the 
correlation towards zero. In most experiments, most genes are not signif- 
icently affected but do show small random measurement errors Select™ 
only the '95% confidence' genes for the correlation calculation, rather 
than the entire genome, reduces this bias and makes the actual biolooical 
correlations more apparent. 

Conelations between a profile and itself are unity by definition. Error 
limits on the correlation are 95% confidence limits based on the individ. 
ual measurement error bars, and assuming uncorrected errors" They do 
not include the bias mentioned above: thus, a departure of p from unity 
does not necessarily mean that the underlying biological correlation is inv 
Z^F™"; * correlation of 0.7 ± 0.1. for example, is very signifi- 
cantly different from zero. Small (magnitude of p < 0.2) but formally 
significant correlation in the tables and text probably are due to small sys 
tematic biases in the Cy5/Cy3 ratios that violate the assumption of inde- 
pendent measurement errors used to generate the 95% confidence 
limits. Therefore, these small correlation values should be treated as not 
significant. A likely source of uncorrected systematic bias is the partially 
corrected scanner detector nonlinearrty that differently affects the Cv3 
and Cy5 detection channels. y 
The 1 ug/ml FK506 treatment signature was compared with more 
than 40 unrelated deletion mutant strain or drug signatures. These con- 
trol profiles had correlation coefficients with the FK506 profile that were 
d,stnbuted around zero (mean p . -0.03) with a standard deviation of 
0.16 (data not shown), and none had correlations greater than p . 0 38 
Similarly, the calcineurin mutant strain signature correlated well with the 
CsA treatment signature (p - 0.71 ± 0.04) but not with the signatures 
from the negative controls (mean p . -0.02 with a standard deviation of 
0. 1 8/. 



•roller spots have fewer image pixels in the average. TWs does not d*. 
grade accuracy noticeably until the number of pixe* fans betowtan h 
wh*h ease the spot i, rejected from the data J" *.nS-^^ 

regwns by the image processing software. Unequal spot -wander' wttNn 
• subregwn greater than half...,pot spacing is a ditneutty for the W 
mated quantitating algorithms: in this case, the spot is rejected f«« 

*ZT 00 hum8n of the 'wander^ spc^rS 

ove,iapp,ng are excluded from the data set. Less than l£ tfspoTivS 
eally are rejected for these reasons. 
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Quality controls. End-to-end checks on expression ratio measurement 
accuracy were provided by analyzing the variance in repeated hybridiia. 
Hons using the same mRNA labeled with both Cy3 and Cy5. and also 
using Cy3 and Cy5 mRNA samples isolated from independent cultures of 
the same nominal strain and conditions. Biases undetected with this pro- 
cedure. such as gene-specific biases presumably due to differential incor- 
poration of Cy3- and CyS-dUTP into cDNA. were minimized by doino 
hybridizations in fluor-reversed pairs, in which the Cy3/Cy5 labeling of 

k C '!!! 9 ^ COndltl0m W " reVCrSed in one "Pe'iment with respect to 
the other. The expression ratio for each gene is then the ratio of ratios be- 
tween the two experiments in the pair. Other biases are removed by aloo. 
nthmic numerical de-trending. The magnitude of these biases in The 
absence of de-trending and fiuor reversal is typically about 30% in the 
ratio, but may be as high as twofold for some ORFs. 
Expression ratios are based on mean intensities over each spot. Some 
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co mosaic vMINAwit obtained by phenol and 
chloroform extractions of the virus and predprtated 
from cthanoL CA-NC assembly reactions in the pres- 
ence of noncognate RNAs were identical to those 
f tven in (9). in the absence of UNA. CA-NC cones 
formed under the following conditions: 300 pM CA- 
NC. 1 M NaU and SO mM tris-HQ (pH &0) at 3TX. 
for 60 mln. In the absence of exogenous UNA, neither 
cones nor cylinders formed at concentrations of OS 
M Nad or below. Absorption spectra demonstrated 
that our CA-NC preparations were not contaminated 
with BsdmkNa coU RNA (estimated lower detection 
limft was -1 bast/protein molecule). To control for 
even tower teveb of ANA contamtnation, we prem- 
cubated the CA-NC protein with 0.5 mg/mt ribenu- 
dease A (Type LAS. 54 Kunitz U/mg, Sigma) for 1 
hour ** 4 *C which then formed cones normally 
13. V. y, KBshko. data not shown. 
H. M. Ce and K. SarrJer, Own. Phy$. Lett 220 192 
(1994). 

15. A. Krishnan er tt, Naturt 3M. 4S1 (1997). 

16. L B. Kong et at./ Vint 7Z 4403 (1998). 



17. Assembly mbttures were deposited on holey carbon 
gnds. plotted briefly with filter paper, plunged into 
»*Qu»drthane. and transferred to liquid nitrogen. Fro- 
wn grids were transferred to a Philips 420 TIM 
ec^ppe^wrmac^uncoldstafesyTtem.arKlima» 
of partides m vitreous ice were recorded under tow 
dose conditions at 36.000 x magnification and -1.6- 
defocus. 

18. J. T. Finch, data not shown. 

19. P, A. Crowther. Procetriings of tht Third John tones 
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ef at. Virology 196. 288 (1994); £ Bartlis. J. McOer- 

JSTiSSi 7177 (199fi,: L " *• mb °j- 

** 



25 



The Transcriptional Program in 
the Response of Human 
Fibroblasts to Serum 
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The response of mammalian fibroblasts to 
serum has been used as a model for studying 
growth control and cell cycle progression ( /). 
Normal human fibroblasts require growth 
factors for proliferation in culture; these 
growth factors are usually provided by fetal 

V. K ryer and D. T. Ross. Department of Biochemistry 
Stanford University School of Medicine. Stanford CA 
94305. USA. M, B. Eiscn and 0. Botsiein, Department 
of Genetics. Stanford University School of Medicine 
Stanford CA 94305. USA C Schuler and M. S. Boeuski' 
National Center for Biotechnology Information Be-' 
thesda MD 20894. USA. T. Moore and j. Hudson it 
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20892. USA. L M. Staudt. Metabolism Branch. Divi- 
sion of Clinical Sciences. National Cancer Institute 
Bethesda. MD 20892. USA. P. O. Brown, Department 
of Biochemistry and Howard Hughes Medical Insti* 
tute. Stanford University School of Medicine. Stanford 



bovine serum (FBS). In the absence of 
growth factors, fibroblasts enter a nondivid- 
mg state, termed G 0 , characterized by low 



Rg. 1. The same section of 
the microarray is shown 
for three independent hy- 
bwfaatJons comparing RNA 
isolated at the 8-hour time 
P°*nt after serum treat- 
ment to RNA from serum- 
deprived cells. Each mi- 
^oarray contained 9996 
eiernenLs, including 9804 
human cDNAs, represent- 
ing 8613 different genes. 
mRNA from serum-de- 
pnved ceils was used to 



metabolic activity. Addition of FBS or pun- 
fled growth factors induces proliferation of 
the fibroblasts; the changes in gene expres- 
sion that accompany this proliferative re- 
sponse have been the subject of many studies, 
and the responses of dozens of genes to se- 
rum have been characterized. 

We took a fresh look at the response of 
human fibroblasts to serum, using cDNA mi- 
croarrays representing about 8600 distinct hu- 
man genes to observe the temporal program of 
transection that underlies this response Pri- 
mary cultured fibroblasts from human neonatal 
foreskin were induced to enter a quiescent state 
by serum deprivation for 48 hours and then 
stimulated by addition of medium containing 
10% FBS (21 DNA microarray rrybrid3> 
was used to measure the temporal changes in 
mRNA levels of 8613 human genes (S) at 12 
times, ranging from 15 min to 2A hours after 
serum stimulation. The cONA made mom pu- 
rified mRNA from each sample was labeled 
with the fluorescent dye Cy5 and mixed with a 
common reference probe consisting of cDNA 
made from purified mRNA from the quiescent 
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culture (time zero) labeled with a second fluo- 
rescent dye, Cy3 {4). The color images of the 
hybridization results (Fig. 1) were made by 
representing the Cy3 fluorescent image as 
green and the Cy5 fluorescent image as red and 
merging the two color images. 

Diverse temporal profiles of gene expres- 
sion could be seen among the 8613 genes sut- 



Rg. 2. Ouster Image 
showing the different 
dasses of gene egres- 
sion profiles. Five hun- 
dred seventeen genes 
whose mRNA levels 
changed in response to 
serum stimulation were 
selected (7). This sub- 
set of genes was clus- 
tered hierarchically into 
groups on the basis of 
the similarity of their 
expression profiles by 
the procedure of Eisen 
et a/. (6"). The expres- 
sion pattern of each 
gene in this set is dis- 
played here as a hori- 
zontal strip. For each 
gene, the ratio of 
mRNA levels in fibro- 
blasts at the indicat- 
ed time after serum 
stimulation ("unsync" 
denotes exponentially 
growing cells) to its 
level in the serum-de- 
prived (time zero) fi- 
broblasts is represented 
by a color, according to 
the color scale at the 
bottom. The graphs 
show the average ex- 
pression profiles for the 
genes in the corre- 
sponding "duster" (in- 
dicated by the letters A 
to J and color coding). 
In every case examined, 
when a gene was rep- 
resented by more than 
one array element, the 
multiple representa- 
tions in this set were 
seen to have identical 
or very similar expres- 
sion profiles, and the 
profiles corresponding 
to these independent 
measuremen t s clus- 
tered either adjacent 
or very dose to each 
other, pointing to the 
robustness of the clus- 
tering algorithm in 
grouping genes with 
very similar patterns of 
expression 
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Reports 

vcyed in this experiment (Fig. 2); marry of these 
genes (about half) were unnamed expressed 
sequence tags (ESTs) (J). Although diverse 
patterns of expression were observed, the order- 
ry choreography of the expression program be- 
came apparent when the results were analyzed 
by a clustering and display method developed 
in our laboratory for analyzing genome-wide 
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gene expression data (6). An example of such 
an analysts, here applied to a subset of 517 
genes whose expans i on changed substantially 
in response to scram ( 7), is shown in Fig. 2. 
The entire detailed data set underlying Fig. 
2 is available as a tab-delimited table (in 
cluster order) at the Science Web site (www. 
sciencemag.org/featureydata/9M559jhJ). In 
addition, the entire, larger data set for the 
complete set of genes analyzed in this exper- 
iment can be found at a Web site maintained 
by our laboratory (genome-www^tanford. 
edu/serum) (o 1 ). 

One measure of the reliability of the 
changes we observed is inherent in the ex- 
pression profiles of the genes. For most genes 
whose expression levels changed, we could 
see a gradual change over a few time points, 
which thus effectively provided independent 
measurements for almost all of the observa- 
tions. An additional check was provided by 
the inclusion of duplicate and, in a few cases, 
multiple array elements representing the 
same gene for about 5% of the genes included 
in this microarray. In addition, three indepen- 
dent hybridizations to different microarrays 
with mRNA samples from cells harvested 8 
hours after serum addition showed good cor- 
relation (Fig. I). As an independent test, we 
measured the expression levels of several 
genes using the TaqMan 5' nuclease fluori- 
genic quantitative polymerase chain reaction 
(PCR) assay (P). The expression profiles of 
the genes, as measured by these two indepen- 
dent methods, were very similar (Fig. 3) (10). 

The transcriptional response of fibroblasts 
to serum was extremely rapid. The immediate 
response to serum stimulation was dominated 
by genes that encode transcription factors 
and other proteins involved in signal trans- 
duction. The mRNAs for several genes (in- 
cluding c-FOS, JUN B, and mitogen-acti- 
vatcd protein (MAP) kinase phosphatase- 1 
(MKPI)] were detectably induced within 
15 min after serum stimulation (Fig. 4, A 
and B). Fifteen of the genes that were 
observed to be induced by serum encode 
known or suspected regulators of transcrip- 
tion (Fig. 4B). All but one were immediate- 
early genes — their induction was not inhib- 
ited by cycloheximide (//). This class of 
genes could be distinguished into those 
whose induction was transient (Fig. 2, clus- 
ter E) and those whose mRNA levels re- 
mained induced for much longer (Fig. 2, 
clusters I and J). Some features of the' 
immediate response appeared to be directed 
at adaptation to the initiating signals. We 
observed a marked induction of mRNA 
encoding MKP1, a dual-specificity phos- 
phatase that modulates the activity of the 
ERKI and ERK2 MAP kinases (12). The 
coincidence of the peak of expression of 
genes in cluster E (Fig. 2) with that of 
MKP1 (Fig. 4A) suggests the possibility 



1 JANUARY 1999 VOL 283 SCIENCE 



www.sciencemag.org 



that continued activity of the MAP Iriiiase path- 
way is required to maintain induction of these 
genes but not of those with sustained expression 
(clusters land J). The gene encoding a second 
number of the dual-spednciry MAP kinase 
phosphatase family, known as dual-specificity 
protein phosphatase 67pyst2 f was induced lata, 
at about 4 hours after serum stimulation. Genes' 
encoding diverse other proteins with roles in 
signal tra nsduction, ranging from cell-surface 
receptors [for example, the sphingosine J- 
phosphatc receptor (EDG-I), the vascular en- 
dothelial growth factor receptor, and the type II 
BMP receptor] to regulators of G-protein sig- 
naling (for example, NET 1/^115 rho GEF) to 
DNA-binding trajiscriptjon factors, were in- 
duced by serum (Fig. 4A). 

The reprogrammmg of the regulatory cir- 
cuits in response to serum involved not only 
induction of transcription factors but also re- 
duced expression of many transcriptional reg- 
ulators — some of which may play roles in 
maintaining the cells m G 0 or in priming 
them to react to wounding (Fig. 4C). Perhaps 
as a consequence of the historical focus on 
genes induced by serum stimulation of fibro- 
blasts, the set of transcription factors whose 
expression diminished upon serum stimula- 
tion has been less well characterized. 

Genes known or likely to be involved in 
controlling and mediating the proliferative re- 
sponse showed distinctive patterns of regula- 
tion. Several genes whose products inhibit pro- 
gression of the cell-division cycle, such as p27 
Kipl, p57 Kip2, and pI8, were expressed in the 
quiescent fibroblasts and down-regulated be- 
fore the onset of cell division. The nadir in the 
mRN A levels for these genes occurred between 
6 and 12 hours after serum stimulation (Fig. 
5A), coincident with the passage of the fibro- 
blasts through G,. The levels of the transcript 
encoding the WEE) -like protein kinase, which 
is believed to inhibit mitosis by phosphoryl- 
ation of CdcZ diminished between 4 and 8 to 
12 hours after serum addition (Fig. 5 A), well 



Reports 



before the onset of M phase at around 16 hours, 
Wing the possibility of an additional role for 
Weel m an earlier stage of the cell evele or in 
*S**ing the Go to G, transition! Several 

• few 

hours after serum 
nimutation. such as the helix-loop-helix pn> 
terns ID2 and ID3 and EST AA0I63O5. a gene 
with homology to G,-S cyclins. are candidates 
for roles in promoting the exit from Go. 

Genes involved in mediating progression 
trough the cell cycle were characterized by a 
distinctive pattern of expression (Fig. 2. clus- 
ter D), reflecting the coincidence of their 
expression with the reentry of the stimulated 
fibroblasts into the celWivision cvcle. The 
stimulated fibroblasts replicated their DNA 
about 16 hours after serum treatment. This 
timing was reflected by the induction of 
mRNA encoding both subunits of ribonucle- 
ot.de reductase and PCNA, the processivity 
laetor for DNA polymerase epsilon and delta 
Cyclm A, Cyclin Bl, Cdc2. and CDC28 ki- 
nase, regulators of passage through the S 
phase and the transition from G 2 to M phase 
were induced at about 16 to 20 hours after* 
«™m addition. The kinase in the Cyclin 
Bl-CDK pair needs to be activated by phos- 
phorylation. The gene encoding Cyclin-de- 
pendent kinase 7 (CDK7: a homolog of Xt- 
nopus MO 15 cdk-activating kinase) was in- 
duced in parallel with the Cdc2 and Cdc28 

EI^J? 8 " 5AX su 88 es,in S ■ P°<«itial role 
tor CDK7 in mediating M phase. DNA topo- 
isomerase II a, required for chromosome seg- 
regation at mitosis; Mad2. a component of 
the spindle checkpoint that prevents comple- 
tion of m.iosis (anaphase) if chromosomes 
are not attached to the spindle; and the kinet- 
ochore protein CENP-F all showed a similar 
expression profile. 

In die hours after the scrum stimulus, one of 
me most striking features of the unfolding tran- 
scriptional program was the appearance of mi- 
merais genes with known roles in processes 
Levant to the physiology of wound healing 




100.00 r- 



^ bod, genes involved m d« & 

role played by fibrobtea in renwldmg of 
*<jlo, and the extracellular rj^^l 
~*Wv. genes encoding ptotea, nmrfved in 
mtercellular signaling (Fig. 5). Genes induced 
«n ** program encode products that can ffl 
pamnpate in the dynamic process of clotting, 
clot dissolution, and remodeling and pertaps 
contribute to hemostasis by promoting local 

^^"T ffor oum P ,e > «KteheJin-l); 
(gemote chemotaxis and activation ofneu- 
mjh'Mfor «ample. COX2) and recruitment 
«nd extravasation of ^ m ^ 

cS.™ ( eX , ample ' MCP, > : < Hi > P"»»«e 
chemotaxis and activation of T lymphocytes 

for example, interleukin-8 (IL-8)] and B 

lymphocytes (for example, 1CAM-I), thus 

providmg bom innate and antigen-sjeciftc 

defenses against wound infection and recruit- 

">g he phagocytic cells that will be required 

to clear out the debris during remodelmg of 

ntv^ 0V) Prom ° ,e and 
neovascularization (for example. VEGF) 

*£»8h newly forming tissue; (v) promote 
migration and proliferation of fibroblasts (for 
example. CTGF) and their differentiation into 
myofibroblasts (for example. Vimentin); and 
(vy promote migration and proliferation of 
kemnnocytes leading to reepithelialBation 
of the wound (for example, FGF7), and pro- 
TLu ToMemwn ° f melanocytes, perhaps 
contributing to wound hyperpigmeraauon 
(for example, FGF2). 

Coordinated regulation of groups of genes 
whose products act at different steps in a 
common process was a recurring theme. For 
example, Funn. a prohoimone-piocessina 
protease required for one of the processinf 
steps in the generation of active endothelm 
was induced in parallel with induction of the' 

«- all A/CD 10. a membrane metalloprotease 
that degrades endothdin-l and other peptide 
mediators of acute inflammation. waTre- 
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Fig. 3. Independent verification of microarray quantrtation. Relate* «pm* 

assay (9) (left) in the sbtk ^rnptes that w£ e us^Con^ ^ 
miooar^ hybrid,^ (^Data mTtKa^OTB^ 



Time 




normalized to mRNA concentrations and plotted retativ* t« ^ , ^ 
time zero, so that the results could ^co^^T^J^ * 
mrcroarray hybridizations. In general Z^^^^l^ ** 
gave very simitar results (70) ^^rtat^n with the two methods 
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duced A second example is provided by a set 
of five genes involved in the biosynthesis of 
cholesterol (Fig. 51). The itiRNAs encoding 
each of these enzymes showed sharply dimin- 
ished expression beginning 4 to 6 hours after 
serum stimulation of fibroblasts. A likely ex- 
planation for the coordinated down-regula- 
tion of the cholesterol biosynthetic pathway 
is that serum provides cholesterol to fibro- 
blasts through low-density lipoproteins, 
whereas in the absence of the cholesterol 
provided by serum, endogenous cholesterol 
biosynthesis in fibroblasts is required. 

Many of the previously studied genes that 
we observed to be regulated in this program 
have no recognized role in any aspect of wound 
healing or fibroblast proliferation. Their identi- 
fication in this study may therefore point to 
previously unknown aspects of these processes. 
A few selected genes in this group are shown in 
Fig. 5H. The stanniocaicin gene, for example 
(Fig. 5HX encodes a secreted protein without a 
clearly identified function in human cells (J 4, 
/•?). Its induction in serum-stimulated fibro- 
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blasts suggests the possibility that it may play a 
role m the wound-healing process, perhaps 
serving as a signal in mediating inflammation 
or angiogenesis. 

One of the most important results of this 
exploration was the discovery of over 200 pre- 
viously unknown genes whose expression was 
regulated m specific temporal patterns during 
the response of fibroblasts to serum. For exam- 
pie, 1 3 of the 40 genes in cluster D (Fig. 2) have 
descriptive names that reflect their putative 
funcnon. Nine of these 13 genes (69%) encode 
proteins thai play roles in cell evele progres- 
sion, particularly in DNA replication and the 
G r M transition. This enrichment for cell 
cycle-related genes suggests that some of the 



™^»cnes m this duster--for example. 
JWWWH and EST R 13,46, neloWTf 
^hich have sequence similarity to previously 
characterized genes-may n*fes« previorfy 
unknown genes involved m this part of me ceU 
cycle. Similarly, a remarkable fraction of genes 
that were grouped into cluster F on the basis of 
their expression profiles encoded proteins in- 
volved m intercellular signaling (Fig. 2X sug- 
gesnng that a smular role should be considered 
for the many unnamed genes in this cluster A 
dispropomonatery large fraction of the genes 
*1wse transcription dirmnished upon serum 
stimulation were unnamed ESTs. 

Our intention was to use this experiment as 
a model to study the control of the transition 




Fig. 4. "Reprogramming* of fibroblasts. Expres- 
sion profiles of genes whose function is likely to 
play a role m the reprogramming phase of the 
response are shown with the same representa- 
tion as in Fig. 2. In the cases in which a gene 
was represented by more than one element in 
the microarray, all measurements are shown. 
The genes were grouped into categories on the 
basis of our knowledge of their most likely role 
Some genes with pleiotropic roles were includ- 
ed in more than one category. 
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from Go to a proliferating state. However one 
of the defining chrocterisrics of genome-scale 
expression profiling experiments is that the ex- 
amination of so many diverse genes opens a 
on all the processes that actuafly occur 
andnotmerery the single process one intended 
to observe. Serum, the soluble faction of clot- 
ted blood, is normally encountered by cells in 
wo in the context of a wound Indeed, the 
oppression program that we observed in re- 
sponse to serum suggests that fibroblasts are 
Programmed to interpret the abrupt exposure to 
scrum not as a general mitogenic stimulus but 

* * physiol °S ical si P»t signifying a 

wound. The proliferative response that we orig- 
inally intended to study appeared to be part of a 
larger physiological response of fibroblasts to a 
wound. Other features of the transcriptional 
response to serum suggest that the fibroblast is 
an active paniciparc in a conversation among 
the diverse cells that work together in wound 
repair, imerpreting, amplifying, modifying, and 
broadcastin « signals controlling inflammation 
angiogenesis. and epithelial regrowth during 
the response to an injury. 

We recognize that these in vitro results 
almost certainly represent a distorted and in- 
complete rendering of the normal physiolog- 
ical response of a fibroblast to a wound 
Moreover, only the responses elicited directly 
by exposure of fibroblasts to serum were 
examined. The subsequent signals from other 
cellular participants in the normal wound- 
healing process would certainly provoke fur- 

of ** 'ranscriptional program 
in fibroblasts at the site of a wound, which 
this experiment cannot reveal. Nevertheless 
we believe that the picture that emerged 
strongly suggests a much larger and richer 
role for the fibroblast in the orchestration of 
this important physiological process than had 
previously been suspected. 
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We used cDNA microanfays to exoJore the variation ;« . * 

as their doubling time in culture, drug metabolism o,th. hU : w ^S!«i. properties of the cell lines, sudi c 
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Introduction 

Cell lines derived from human tumour, have been extensively used H^HTl phe "° typic "J*** ° f «•* ceUs «"d tissues studied*-", 

as experimental models of neoplastic disease^tK^ch S e*17«7„ Tv^ observed panerns of gene 

lines differ from both normal and cancerous SSS^£ ^ " rebt,on$hi P » Phenotypicpropertie, ofthe 

bility of human tumours and normal tissue im£?E5L wlnT aaaa !W a * «Pon" explores the relationship 

such cell lines will continue to be used as experimental models fo f£TT 5? e ?'™ mn P atlerns and * e dru 8 sensitivity.pro- 

the foreseeable future. The National CnS^SSSSS^ , *1 ^ 9™ The ^"ent of gene expression pat- 

mental Therapeutics Program (DlP^aSl S ST" ' "i?" ^ *"* ^ such * * e di ~~ ~ 

studies of 60 cancer cell lines (the NCJ60) derived fern ? "Sf 1 "^ ^ Under diverse conditions in vim, and 

fromavarieryoftissuesandorgans^ m vm,. should lead to increasingly detailed maps of the human 

molecular features of the cells related oc^Z^d^t^ *T ?T Pr ° gram and pr ° vide due$ » to * e PhysiologS 

peu tic s^sitivity. and has measured m£££J!£%£ ^ 

lines to more than 70.000 different chemical comDom^Z^T !2T , visuabauon of the data, are available (httpV/genome- 

ing all common chemotherapeutics ^«JSw1 A WWS,anford -' du/ "ci60a„d hnp://discover.nci.nih.govr... 

pnrvwus analysis of 'these data revealed a connection between the Results 

panern of activity of a drug and its method of action Inoartimlar w. ,„.a- a 

^^^y^Atr^for^p,^^^^^^^ Z s,ud,ed 6 ent ^xpress.on in the 60 cell lines using DNA 

actrvirytolmerebwdmeAcJrofaaior?^ s,m,lar P 8 "™ <>' ~ r ">* P«P««I by robotically spotting 9.703 human 
We used DNA microamys to survey the variation in a h„n °" m,c /° SC0 P e slides' 7 '". The cDNAs included 

dance of ^art^fcJS o?nscnpJ in these ESS? I > l*"™ 

60 cell lines. Because of the logical connection bS£K££Z ?Z hTh? \ charaC,m2 f d human P™**- ™ additional 

tionofageneandiupanemofexpression.thec.Son^ I homologues m other organisms and the remaining 

expression panerns with the var^ion in fcSSSSfiS uwTcSS d ° * °" l ° ambigUity ° f ^ iden 

can begin the process by which the function of Teene can be Tl I ?^TJ M * d ' hKe Mudies ' ** «™ted that 

inferred. Similarly. *, panenu of ap«S£o4^eSl K" ti°^° f !, he ^ " ,htte "V"^" were cor- 

F 01 ltnown 8enes Mn rectJ V 'dwnfied. The identities of approximately 3.000 cDNAs 

Pr^am. IXviwm «fOi«er Twmtm ^DinMkti^C^fr^' T' ^/'"/""'""'O" Technology Branch. Development*! The,aptutits 
Medical Institute. sLjM U*i^SdJ%5 t )*^^ '"~ ^ Hefl " h - *^' fl '- *<«>*i"4 USA. 'Howard HuZ 
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Hg. TGetie expression patterns related to th# int..* *u ^ " 

«o» 64 cell linel The . J 6 J^," '„ ™' "V* '' 16 ' t0N * 

te»h ttot varied by .t lean OoZESSS^ ^ll""!? """"W 

ten 4 of 60 cell lin«. This effect*,, ^ed ^^h tl"' '° "* " ,e,ent « P°° lin " 
»ion level .„„„ ,„, to cell line. (iJudinTMy^^^Tl! " ,<m in 
««e pool), lno thtfe<0( , highligh,* tW^™ "P" 1 ""* rri «- 
distinguished the cell Kn« from w .nmher D«. p * nerr " thai ben 

♦or each cell line p lu , the two w^rT^o^T? " •"*"**"*>'» we u-rd. one 
«"« K562 end MOT. The two cen °n„ 2EZ£? ' e| " e,enu,i "» °< of the cell 
7*"™ «" «* 9«n. dim.*, „ .hTeJcTonheK ^'£lT """Ponding^ 
clustering. a, The celWirw dendrogram, with the tWmmi nt " tnnt1 > «•»"•"» to the 

«t«»ib.. tissue of origin of the Tell *? Mm Colou,M » «"•« the 

Ph. pron.,,; light blu? Iun»o,.nge/o« ^X^fr n ' ZZ* brM * 
buck, unknown WCUAOR-M$». T^ale,^*^ EEST*, 0 ** b,0wa ■"•"""•J 
relation coefficient represented by the JZ%*, IX" 1^1 " t '"" 0 S"' n «*Picts the cor- 
P»r» of nodes. Not. th.1 the twe 7vZte**%£J£'^!^ m b " nches connecting 
tiohtly ,og«h., .nd were well dm«,^,to f^Jll .Ji " ,nd MC "> <""« 
indicating th« thi, clustering of ceinfrT^XTr £ m .°" '«'•'«« imes. 
expression partem ,„„,, t h.„ ,„«,,„, 0< , ' ."™.",'.r in thei, g,„. 

representation of the on. table. with the rom |oe^ i^T if' 01 **"" *• A coh »"«» 
order. The dendrogram repressing hX£ fSSLS^T 1 (< "' l ' n " ) in *"» 
ted for clarity, but is ...liable (rntp7«r»r^^^?^ 9en " w " •«*- 

cell of this table reflects the meerMdhmed i„7-.ZZ"l™ "r*' 60 '- Th « "'our in each 
(column). The colour sc.le u>M to Cetm , ~™ *?. (,ow) * na <«" «"« 

3-3d" in (b) reta ,0 th, clusten erf cZT^nZ^F,,. 3'" " ^ The 
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cell lines 



J from these experiments have b«n scauen<-r>.v»r,fi-^ • 1 j. 

* aU of those referred .0 here by n^e.^ Uldud,nB 

I «S tran f cribed ^ a - reference mRNA" 

S ^t t^ ^ Sa,nple • » a" hybridiiations, was 

S ESS y T b,ning an equal mi « ure of ^ fr 0 ri 2 7f 

« the cell hnes (chosen to maximize diversity in o» n . 

determined prim arif y from rwo-dinT2„7^ 'iSS?* 
companng cDNA from each ceD line with a coLJn refoence 
SK" «S" acrc " A* 60 cell linelS S 

mferred from the observed variation in the normalized Cy5/Cv3 
ratios across the hybridizations. ^-ya/^yj 

To assess the contribution of artefactual sources of variation in 
*e «penmentdJy measured expression « ° 
MOTccU Jhne. were each grown in three independent culture/ 

Z ^ 'T' ^ 7" " nK6 0W i^ependently on M 
exacted from each culture. The variance in the triplicaXo 
rescence rauo measuremen« approached a minimum when the 
fluorescence signal was greater than approximately 0 4% „f k! 
me«ur.ble total I signd dynamic rang? SJ ScJg^uSd t 
«th« channel of the hybridization. We selected the^ubs« of 

he l^t " Om ^ ,0r ° f rati0S ^ th " "i'«ion to idenufy 
Jr th^"™.~ Ured /'! 0,i - ^ coefficienu 

Snevei fi£ " ^ ? ^ ,h8 * P aSSed this «J«al«y co" 
trol level (6,992 spots included for the MCF7 samples and 6 161 

spots for K562) ranged from 0.83 to 0.92 (for graphs 2 ^detail 
sechrtp^/genome-www^tanford.edu/nci60) 

To make the orderly features in the data more apparent we used 
a hierarchical clustering algorithm'™ and a pseuTc^uTv^ 



genes whose expression level varied™ uSe 60 oeU 

(F«g. ), we concentrated on those genes that showed T 
variation in expression among the 60 ceU W f I t ^ n \ 

measured in the reference set (6,83 1 spots). 
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wlrh^mo? 1 ' ° f ,hC C,US,ered da,a «« ««»« «U lines 

central nen-ous system, colon, renal Sovatni 
ered .mo independent terminal branches specinc to tneT™ 
«'ve organ types with few exceptions. CeU lines derivedTn," 
non-small lung carcinoma and breast tumours werVd^K .^ 
» multiple different terminal bn^^S^S^ 
expression patterns were more heterogenous. § 

Many 0 f these coherent cell line clusters were distinguished bv 
H. """"T ° f characteristic groups K 

(Fig 30-rf). For example, a cluster of approximately 90 eeni wH 

was enriched for genes w,th known roles in melanocyte bioloev 
indudrng tyrosinase and dopachrome tautomerasMTYR and 
2 ,W ? J * U l b " n " s ofan complex involved in melan"n 

a«e. for mm ^ (MU ? A: Which » bein 8 -vestigaTeS ft 
S fo a V mmuno,h " a Py of melanoma") ,„d SlOO-P (SJ00B; 
which has been used as an antigenic marker in the diagnosb o 
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."9.,2-Gene expression patterns related to 
other ceJUine prwrtotypes. a. We applied 
twoHJimeraionaJ hierarchical dustering to 
expression data from a set of €431 eONAs 
m»*ur«d acres the 44 cell lines. The 
<D*A* were those wtth a minimum fluores- 
cence *onel intensity of approximately 0.4 V 
of the dynamic range above background in 
the reference channel in each of the six 
hybridizations used to establish reprodudbft- 
Thb, effectively selected those spots that . 
provided the 'most; reliable /ratio measure- : 
ments and therefore identified a subset of 
genes useful for exploring patterns comprised 
of those whose variation in expression across 
the 60 cell lines was of moderate magnitude, 
*>, Cluster-ordered data table, c. Doubling 
time of cell lines. Cell lines are given in duster 
order. Values are plotted relative to the mean. 
Doubling times greater than the mean are 
shown in green, those with doubling time less 
than the' mean are shown in red. d. Three ; 
related gene clusters that were enriched for. 
genes whose expression level variation was 
correlated with cell line proliferation rate. 
Each of the three gene dusters (clustered 
solely on the basis of their expression pat- 
terns) showed enrichment for sets of genes 
involved in distinct functional categories (for ' 
example, ribosomal genes versus oenes 
involved in pre-KNA splicing), e. Gene cluster 
in which all characterized and sequence-veri- 
fied cDNAs encode genes known to be regu- 
lated by interferons. f t Gene duster enriched 
for genes that have been implicated in drug 
metabolism (indicated by asterisks). A further 
property of the gene clustering evident here 
and in Fig. 2 is the strong tendency for redun- 
dant representations of the same gene to 
duster immediately adjacent to one another, 
even within larger groups of genes with very 
similar expression patterns. In addition to - 
illustrating the reproducibility and consis- 
tency of the measurements, and providing 
independent confirmation of many of our 
measurements, this property also demon- 
strates that these, and probably all. genes 
have nearly unique patterns of variation 
across the 60 cell lines, tf this were not the 
case, and multiple genes had identical pat- 
terns of variation, we would not expect to be 
able to distinguish, by clustering on the bash 
of expression variation, duplicate copies of 
individual genes from the other genes with 
identical expression patterns. 
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higher.]evd orgaiuzation of the ceflE^' 1- . W»"»*> relation to cell prcJi&S? " du «er w» 
g~W ceflli„e^ thesecelliines, , - ^.^^ or gmwth .ate in 

shared biological .properties ofthe tissue fZTj? ° k "? ec,ed ln a sm aB« gene cluster (Fit Ml a ii f .v 

•lines were-de^^n^on^eE ?£Z ^ previouri )' to ■SSSedt? £?* ^^wcre 

»«o major branches, that sepaSedS*e S^L^Jr^ ^^-SS5^2 ,a *2T^ Additional 

. charaaeristic of epithelial E *Z 2? gmes <*P™ion (data mi-SSmfrJZZ^T* J*™ 0 P™"™ of 

genes in this dusterhave been mpuS^^JT^ ^"S^^VM^lS^^ metabolism, 
epithelial ceD bioloey"; The dusttr »»«-!!• Mpwts of s,w e for the rate limhW «Ir.!"/f ^7°* ^ enzyme respon- 
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epithelial c^Jud^t^ ^^ membnn ^f ■ J*™™* enzymes 
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periplakin (PPL^d %2fi f (DSP) ' iA l£S ttM!p ° n gl»tathione-co„j u ! 

-express ceU^ adherio^jSe (S .5 ^^SdsiiKSrlSS^^^"-" 

hydrogen ion exchanger*-" (SLC9A1 ) H chemotherapeutics. X Se,ecUon for Stance to 
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phogenesu, a human homologue of7K>Wl7 , m ° r " Ce, " in « facilitate interpretation of „. 

epithelial-expressed rumour s-Ppre^r^ ^hXoT ,T" W C ° mp ' e * £ 9606 "P""* 0 " 
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noma-derived lines. The charaaerS ^ of JeneT ^ «P ressio » P""Sta C s£c£e?* "'"P™" 8 *< 
this duster induded many vAosewoulL J£ m iso,awd ^ ^o breast Ser biol ' « > C ° mpared ^ 

mal cdHunctions (Fig. 3^. J^S^^SSZ ^ ^~^Cf!fcS!£l 
desenbed as •sarcoma-like' in appearance (HsS7«T Zl ? y cancers <« £ )«ding MDA-MB-435 and M n a wTj i m brean 
nosarcoma, and SFS39. ghosarcomal e»rS brea$, r carc '- (% 4). This clustering hSSiSed fl if ^ ^ Ieukae ™« 
gene^^. Although no smgle g«e waslES T ° f ,heSC sion P attffn sha ^ bSS S Sllf, ° f the 8 * ne "P 1 * 8 " 
of this duster. JJ cell iS^J.-tSS^'SS'S ^^^^SSaE^ 1 ^ 
expression of genes encoding proteins witk m i J • of g ene s encodinc keratin 8 IK bt*\ ' j l ™ 
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(1X)X) and colkTen subtypes). Sugh to^^SZ" and ,hf «*» WdSS S s MC M 5 fc 
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teristic of both ephhelial cells and s^rr^cX tE n^M " f CriplS ° ri 8 ,na,f d in '"".our ceU StnSS * i^,' ^ tn,n - 

Physiological variation reflected stromal-like cell lines H S 576T andl^?S"^t 

m gene expression patterns of tne expression pattern seen in the tumn , Th,s ,.f ea,ure 

A duster diagram of£l genes (Fig 2) is useful fi, , ** 6w 10 lht corZnem S^um""^ " ^ ,0 

clusters of genes whose variauon £1°' "P' 0 "^ also expression of aTof I „« (Fi^^T';^ ,Um ° Urs 

ousry attributable to cell or tissue r^nTW^S^ "* ° bV " n,yelon,a ceU «"e (S»m!S« * I ^ m ^ pk 
^t were en nch ed f 0r S^^'^Z JTE^^ A ^ 

our (mis was confirmed by staining with anti- 
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3 Gen« dun.n ,e Ut ed to tisiue char.nerislia in the cell li«« 
'or genes e«preued in cell lines of ostensibly limiu, »I2L "Uroements of the regions of the cluster Hi»„, « m c , 

? en« that ^re .jessed in rnosl leuk.emi:^"^ L^Z" ^ ^ " p '«"<* "h?*^?^^ ^ ° ene '-«hed 

fons duster together), t. Cluster of genes Wgl»££j^ e * C,UiiV<,y ln ,he ^rob.alto dtne r 5« ^.1"° lUO< ' Ui,e " di » in 9uish 

set of genes was also moderately expressed in m«tl CO, °° a/7) ce " ,in " * n <* •» breast^erivLd Z »f' 2 <n ° ,e that ,he <'iP'*c«e hybr«i2a- 

cer^erived .ines. c. Cluster of gene^y lmn M) * nd ^^'^l^^rS T P ° l ' t,Ve ^ ° eSUO ° en 

moderately expressed in a subset of carcinoma 7 1 0,,ob,a » om » «*> lines and most line FuZ * U ° m b,MM <M0A. 
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«9. 4 Comparison of the gent «urtuion n.tu.._ » . ^_ 
;SPtei*,« and cultured br«M <*Z^J2ZZ 'V?!? 1 bfMtt «•"»» 

•nd the NCJ60 bf.« ln d>ut«r»«» f nwd^i^l^r t W> "~' bww 

•ubm of th. NOW cell «„„ lo^^y^Tj^T ^* « 
tern, observ* in sp«if k line. co^lieSta SZ!^ «*Pr~io« ««• 
indicate gene clun. n Ohown in deuH « rTS^I? Ubrt » 
celhrt., component, o^j^l^li' ^*l m ^ b * » ipecffic 
fined wrth .mi*^SSSS^ «P«cim« .6 

character**..*' fenmd m *JJS»tn2 l^TEE'S' ^ t *" ^ 
caJMar component, of thb thJt^Z^JT*" h *.'* 9ht *• dl ""«"« 
•ene «p,«,ion c W .nehrt^^T *■»• **"9«i»ha<. by the 
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mrnunoglobubn amibodies; data nor shown). Therefore, dis- 
tinct sets of genes wrth co-varying expression among the samples 
(Fig. 4, arrow) appear to represent distinct cell types that can £ 
distmguished » breast cancer tissue. A fourth dE?of «n« 
more highly expressed in all of the cell lines than in any Re 

eration duster described above (Fie hd\ Th* 
expression of ^^^^g^Z?^ 
eration rate between the rapidly cycling cultured cell lines J£$L 
much more slowly dividing cells in tissues. ^ 

Discussion 

Newly availablegenomics took allowed us'to explore variation in 
gene expression on a genomic scale in 60 cell lines derived from 
diverse tumour ussues. We used a simple duster analysis to idT 
ufy the prominent features m the gene expression patterns that 
appeared to reflect -molecular signatures" of the tisauTfrom 
which Ae cells onginated. The histological characteristics ofSe 
ceD h„es tlwit dommated the clustering were pervasive enougE 
that smular relat.onsh.ps were revealed when alternative subsS 
of genes were selected for analysis. Additional features of tS 
expresston pattern may be related to variation in ph^Tological 
attributes such as proliferation rate and activity of ilZ! 
response pathways. "ucneron- 

The properties of the tumour-derived cell lines in this study 
have presumably afl been shaped by selection for resistance to 
host defences and chemotherapeutics and for rapid proliferation 
mAet^eculture^^ 

•T?* * P 0 ^™ Stratum. But the primary 

identifiable factor accounting for variation in gene ex£es2 
panerm among A «e 60 cell lines was the identity ofSSS 
from whtch each cell line was ostensibly derived,For most ofZ 
cell lines we examined, neither physiological nor «p£ mental 
adaptauon for growth in culture was sufficient to ™2 

,hr„n7 T'^ Z T^" of mes "chymal features in 
the cell lines isolated from glioblastomas and carcinomas maj 

£™ $e,etU ° n f ° r ^ re,8,iVe MSe of ««W"hmen, of «J 
lines expressing stromal characteristics, perhaps combined with 
physiological adaptation to tissue cultureVondhions^o 



; known genes withm the dusters. Uncharacterized cDN a, ™ 
Jkely to encode protein, that have mlStttSS £ 

^ h "«p*»?<*m ^^Sletodi^SS" 
mon theme linking, the identified members of the dZfi>2 
cxplorat,o„ of their variation in expression - J5?5 dh£l 

express specific sets of genes in the histolog^y com pl « breal 
cancer specimens-. Although i, is now feasible"* ^r«e 
expression in micro-dissected tumour specimens^ " thu^K.T 
vat,on suggests that.it will be possible ^S™^.! 
some of the biology ofclinical SmJJ^^J^SSZ 
muct As is useful in conventional morpholoSSfofy o^e 
might be able ,o observe interactions between a Sur Sd ^ 
microeriv.ronment in this way. These «latiomW«^bedln 



Methods 

cDNA done. We obtained ihe 9.703 human cDNA clones (D— r 
.cs) used in thee experimmU as banerial coloiril inll^? GtnW " 
IW. App,o«ma,ely 8.000 dminc, ^^^^l""^^ 
nally u„ ipu . gfno) were represented cWaS 7™' 

ned here by name represent clones whose idem.L we^'^fi^ k" 1 ''" 
«quenc,ng. or by ,hc criteria ,h„ rwo or ¥ 
os^ibly representing the same gene mZ^Sm^^T 1 
patterns. A single-pass 3" sequence wrtoJl™*^^?™" 
clone after re-.treaking for single cotenies. SSfaS 

3 sequencing. Of ihc subset of clones selected for 5' $«uen« vrrifi..? 
^'t^We'es^ 

DMA representing more than ~££™£l?££%SF* f 
-3.000 clones have been verified. The fuO toof c ndfhri nolf 

(Stanford Identtfication) represent clones who*, identhie, hT^o. yet Sn 
vermed; http^/genome.www.«anford.ed U :8000/nci60) 
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I?* JL nr 5L p , " mtds "P'wenied in the clone set ( 5 '-TTCTaaa Ar r a nr 
GCCAGTG-3', 5 -CACACAGGAAACAGCTATG^VS^^ 
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for Memhtag and opening • microarriy primer, andprinti™ ,„d «»T 

JVqwition'of mRNA and referencr pool. Offline ■.'•^L v,„ 

*»ch oeD line to 80% confluai, ShS^aX^^"' 
buffer containing guwidium i»ihiocyana,e and ^™?™ h ^ m - 




■•apt t4M».in 

5? V'SgJJJ, " ,a * T0 » "TOwcibc ' 



using i po)y(A) purifiation kit (Oiigoirx Oiitm) . « 

ine bretjt tumoun were surricalk «ri«~i t» . 
imported ,o the P^Uk^SSS^ZlZT 
^ were ^ frS, ^ 

increasing «o -20.000 r.jui M penrfof ^ £ 

iniiial «ep .o renX f, L Or^a^ ^Z^™ 1 T^*"* ™ 
with , Fa«Track 2 0 kit (Invitmo^w ^ obllu, « J - we *>l«<«i mRNA 

*>i«ing mRNA ^3^^^^'* "T^ * 
obuinedfromOomeeh: . ' ^ ™ enonM,bre «'»'npl«%«w 
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We combmed mRNA from the following cells in equal quantities to 
^ ^f^f 1^1: ^ (acu,e'mytlo. d leukaemia) and K562 
(duomc nrydoad kukaeini. ; NCNH226 (non-snuU-ceJl-lung); COLO 
^ ( ^ n); f^f* 19 <« nlraJ «w»ui .y«em); LQX-IMV] (melanoma)- 

MCF7 and W7tp- n>««t),'n 1 eentenon for selection of the ceD lines in 
the reference .re;descxibed rn^ilin me accompanying man JcriptT 

isisffi d ° uwin8 limtt ^ « 

^NO60 <«8 Jmc ^ compound screening data; and they reflect the dou- 
bhng ume* forceHs mocul-ted mto^weil plates ,t thesoeening inocula; 
gn . oW, and grown « RPM1 1640 medium supplemented^ 5% 
bonne *rum for 48 h. We measured cell populations using sulforho- 
damme B opucaJ density measurement assay. The doubling time constant k 
wascalouated using the equation: N/No. = where Nol ^S 

«fcr«-h mcBteipQ, and It »■* h. The same equation was then used with the 
derived k to calculate the doubling time t by setting N/No £'Z For a given cell 
^. we obtained No and N values by averaging optical cfensiues (NXtfOO) 
obtaine^ for each ceJfcne forayear . screening. Dau and experimental details 
air available fhttp-7/drpjid.nih^ov). ;• 



•t hwp-J/ranajunfoid.edu/ioftwire). Each spot wai defied K. . 

phoning of, grid of drck, cm the .n^LgT^tS 

"i^ 10 W " f COmpU " d fa »P°' «^ to ZmX^xd 
mtenuiy m a square of 40 pad, in width and height earned on the «k 
• «n.re. excluding til pixels within any defined ,poTn« 
, mu,ed by M bu,cUon 0 f jh« local background EL ,£ 
of j£ TJ* SPP " dwm * doniuiMUe 'or accurate quanuu.L^ecaC 
££L T n,g?td lnd «1^ from further analT 

^ g r^ r, " d * W,yle ^ cmrred into a ciwom datable 

unage, were normalized by applying a uniform Kale UcZxort 
m.ensme, mosured for the Cy5 channel The normdization W was 
chosen so tha, the mean log(Cyi/Cy5) for .' subset of spots that .chfeved? 

Kg - -* array ,0 



Preparation and hybridization of fluorescent labelled cDNA, For each 
comparative array hybridization, labelled cDNA was synthesized by reverse 
transcription from test cell mRNA in the/presence of Cy5-dUTP and from 
the reference mRNA with Cy3-dUTJ>, uring the Superscript II reverse d 

2 Mg) was mixed with ,n anchored oligo-dT (d-20T-d(ACC)) primer (4 
n) ui a totd volume of 15 ul. heated to70 -C for 1 0 min and cooledTice 
To this sample, we added an unlabelled nucleotide pool (0 6 ul- 25 m M 

aUTP(3 uj; 1 mM; Amersham), Sxfirst-strand buffer (6 ul- 2SO mM tv;. 
HCU pH I J. 375 mM KO. 1 5 mM Mgd,). 0.1 M SSRSiSi Jfc 

£ the UNA was degraded by adding 1 N NaOH ( 1 J M l, , nd incubating a, 
70 c tor 10 mm. The mixture was neutralized by adding of 1 N HCL f 1 J 
ul)..ndthe^volume brought to 5M ul wimTICWmMtris. ImMEDTA) 
Wt added Cotl human DNA (20 pg; Gibco-BRL), and purified U,eprobe 
by centrifuganon in a Centr.con-30 micro-concentrator (Amicon) The 
two separate probes were combined, brought to a volume of 500 ul , n d 

CO ? C , C r!nlw '° ' VOlUme rflt8lta ' Ml- V* added 10 ug/ul 

po^A) UNA (1 Ml; Sigma) and tRNA (10 Mg/Mli Gibco-BRL) were JSS 
and adn.«ed the volume to 9J Ml with distilled water. For find vrob, 
preparauon. 20xSSC (2.1 ul; 13 M Nad, 150 mM NaOtrate dH 8 01«£ 
10% SDS (0.35 Ml) were added to a ,o,d find volume of K S ^e 'lte 
were denatured by healing for 2 min ,« 100 «C incubated at 37 Z 
20-30 nun. and placed on the array under a 22 mmx22 mm glass coverdip 
We mcubated sl.de, overmght at 65 "C for 14-18 h in , custom dide clum- 
ber with humidity maintained by a small reservoir of 3xSSC Arrays were 
washed! by submersion and agitation for 2-5 min in 2xSSC with 0.1 % SDS 
followed by I XSSC and then O.lxSSC The array, were "spun dry" by ceo-' 
tnfugat,or. for 2 min in a dide-rack in a Beckman GS^ ttWetop cemrifuee 
in Microplus carrier, at 650 r.pjn. for 2 min. 



ZkZZ t?T w Wbte ,row, of ^ column, of indWdud 
mKroamy hybridoations) of normalized fluorescence ratio, from medau^ 
base. Various sdecuon criteria, diseus«d in relation to each dau Z *1 
applied to sdect subseu of gene, from the 9.703 cDr4Adem«^T 
arnry, Before dustering and di^. *16j£-£?£Z£££ 
cencerauos for each gene were centred bysubtraoing theariSmeTof 

mdependem of the amount ofoch gene', mRNA in the rde^od 

We appLed a hjera^hical clunering algorimm «p»a,eh>^ndl li™, 
and genes using the Pearson correlation coefficient the measure of sM. 

S d"d ' V " ,ge T dut ' CT »8 , - ,W '- ^ resulu oMhTprU^re 
two dendrograms (trees), one for the ceD line, ini otu for m ; ~~ "* 

»h>ch very similar elements are connected by short bnnrh JtIS^" 
display the row, and column, in the initial data table were reord«J^o 

^uZrTi ******* ^S," 



Array quantitation and dau processing. Following hybridiiation, arrays 
were scanned using a laser-scanning microscope (ref: 17; hnp://cm g m 
si.nford.edu/pbrown). Separate images were acquired for Cy3 and CvS We 
earned out data reduction with the program ScanAlyie (M.B.L, available 
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